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THEORIES OF EVOLUTION 


One of the major incentives in the pioneer studies of heredity and varia- 
tion which led to modern genetics was the hope of obtaining a deeper insight 
into the evolutionary process. Following the rediscovery of the Men- 
delian mechanism, there came a feeling that the solution of problems of 
evolution and of the control of the process, in animal and plant breeding 


Genetics 16: 97 Mr 1931 








98 SEWALL WRIGHT 


and in the human species, was at last well within reach. There has been 
no halt in the expansion of knowledge of heredity but the advances in 
the field of evolution have, perhaps, seemed disappointingly small. One 
finds the subject still frequently presented in essentially the same form as 
before 1900, with merely what seems a rather irrelevant addendum on 
Mendelian heredity. 

The difficulty seems to be the tendency to overlook the fact that the 
evolutionary process is concerned, not with individuals, but with the 
species, an intricate network of living matter, physically continuous in 
space-time, and with modes of response to external conditions which it 
appears can be related to the genetics of individuals only as statistical 
consequences of the latter. From a still broader viewpoint (compare LoTKA 
1925) the species itself is merely an element in a much more extensive 
evolving pattern but this is a phase of the matter which need not concern 
us here. 

The earlier evolutionists, especially LAMARCK, assumed that the somatic 
effects of physiological responses of individuals to their environments were 
transmissible to later generations, and thus brought about a directed evo- 
lution of the species as a whole. The theory remains an attractive one to 
certain schools of biologists but the experimental evidence from genetics is 
so overwhelmingly against it as a general phenomenon as to render it 
unavailable in present thought on the subject. 

DARWIN was the first to present effectively the view of evolution as pri- 
marily a statistical process in which random hereditary variation merely 
furnishes the raw material. He emphasized differential survival and fe- 
cundity as the major statistical factors of evolution. A few years later, the 
importance of another aspect of group biology, the effect of isolation, was 
brought to the fore by WAGNER. Systematic biologists have continued to 
insist that isolation is the major species forming factor. As with natural 
selection, a connection with the genetics of individuals can be based on 
statistical considerations. 

There were many attempts in the latter part of the nineteenth century 
to develop theories of direct evolution in opposition to the statistical 
viewpoint. Most of the theories of orthogenesis (for example, those of 
Ermer and of Cope) implied the inheritance of ‘acquired characters.”’ 
NAGELI postulated a slow but self contained developmental process within 
protoplasm; practically a denial of the possibility of a scientific treatment 
of the problem. Differing from these in its appeal to experimental evidence 
and from the statistical theories in its directness, was DE VRIEs’ theory of 
the abrupt origin of species by “mutations.”’ A statistical process, selec- 
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tion or isolation, was indeed necessary to bring the new species into pre- 
dominance, but the center of interest, as with Lamarckism, was in the 
physiology of the mutation process. 

The rediscovery of Mendelian heredity in 1900 came as a direct con- 
sequence of DE VRIES’ investigations. Major Mendelian differences were 
naturally the first to attract attention. It is not therefore surprising that. 
the phenomena of Mendelian heredity were looked upon as confirming DE 
VriEs’ theory. They supplemented the latter by revealing the possibili- 
ties of hybridization as a factor bringing about an extensive recombination 
of mutant changes and thus a multiplication of incipient species, a phase 
emphasized especially by Lotsy. JOHANNSEN’s study of pure lines was 
interpreted as meaning that DARwIN’s selection of small random varia- 
tions was not a true evolutionary factor. 

A reaction from this viewpoint was led by CastLe, who demonstrated 
the effectiveness of selection of small variations in carrying the average of 
a stock beyond the original limits of variation. This effectiveness turner, 
out to depend not so much on variability of the principal genes concerned 
as on residual heredity. As genetic studies continued, ever smaller differ- 
ences were found to mendelize, and any character, sufficiently investigated, 
turned out to be affected by many factors. The work ot NILsson-EHLE, 
EAST, SHULL, and others established on a firm basis the multiple factor hy- 
pothesis in cases of apparent blending inheritance of quantitative variation. 

The work of MorGANn and his school securely identified Mendelian hered- 
ity with chromosomal behavior and made possible researches which fur- 
ther strengthened the view that the Mendelian mechanism is the general 
mechanism of heredity in sexually reproducing organisms. The only ex- 
ceptions so far discovered have been a few plastid characters of plants. 
That differences between species, as well as within them, are Mendelian, 
in the broad sense of chromosomal, has been indicated by the close paral- 
lelism between the frequently irregular chromosome behavior and the ge- 
netic phenomena of species crosses (FEDERLEY, GOODSPEED and CLAUSEN, 
etc. ). Most of DE VriIEs’ mutations have turned out to be chromosome 
aberrations, of occasional evolutionary significance, no doubt, in increas- 
ing the number of genes and in leading to sterility of hybrids and thus iso- 
lation, but of secondary importance to gene mutation as regards character 
changes. As to gene mutation, observation of those which have occurred 
naturally as well as of those which MULLER, STADLER, and others have 
recently been able to produce wholesale by X-rays, reveals characteris- 
tics which seem as far as possible from those required for a directly adaptive 
evolutionary process. The conclusion nevertheless seems warranted by 
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the present status of genetics that any theory of evolution must be based 
on the properties of Mendelian factors, and beyond this, must be con- 
cerned largely with the statistical situation in the species. 


VARIATION OF GENE FREQUENCY 
Simple Mendelian equilibrium 


The starting point for any discussion of the statistical situation in 
Mendelian populations is the rather obvious consideration that in an 
indefinitely large population the relative frequencies of allelomorphic 
genes remain constant if unaffected by disturbing factors such as mutation, 
migration, or selection. If [(1—q)a+qA] represents the frequencies of 
two allelomorphs, (a, A) the frequencies of the zygotes reach equilibrium 
according to the expansion of [(1—q)a+qA |? within at least two genera- 
tions,! whatever the initial composition of the population (HARDY 1908). 
Combinations of different series are in equilibrium when these are com- 
bined at random, but as WEINBERG (1909) and later, in more detail, 
Rossins (1918) have shown, equilibrium is not reached at once but is ap- 
proached asymptotically through an infinite number of generations. 
Linkage slows down the approach to equilibrium but has no effect on the 
ultimate frequencies. 


Mutation pressure 


The effects of different simple types of evolutionary pressure on gene 
frequencies are easily determined. Irreversible mutation of a gene at the 
rate u per generation changes gene frequency (q) at the rate Aq = —uq. 
With reverse mutation at rate v the change in gene frequency is Aq= 
—uq+v(1—q). In the absence of other pressures, an equilibrium is reached 
between the two mutation rates when Aq =0, giving q=v/(u+v). 


Migration pressure 


The frequency of a gene in a given population may be modified by mi- 
gration as well as by mutation. As an ideal case, suppose that a large popu- 
lation with average frequency qm for a particular gene, is composed of sub- 
groups each exchanging the proportion m of its population with a random 
sample of the whole population. For such a subgroup, Aq = —m(q—qm). 

The conditions postulated above are rather artificial since, in an actual 
species, subgroups exchange individuals with neighboring subgroups rather 


“1 This statement assumes that there is no overlapping of generations which may bring about 
some delay in the attainment of equilibrium. 
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_ than with a random sample of the whole species and the change in q will be 
only a fraction of that given above. The fraction is the average degree of 
departure of the neighboring subgroups toward the population average. 
The formula may be retained by letting q, stand for the gene frequency of 
immigrants rather than of the whole species. 


Selection pressure 


Selection, whether in mortality, mating or fecundity, applies to the or- 
ganism as a whole and thus to the effects of'the entire gene system rather 
than to single genes. A gene which is more favorable than its allelomorph 
in one combination may be less favorable in another. Even in the case of 
cumulative effects, there is generally an optimum grade of development 
of the character and a given plus gene will be favorably selected in combi- 
nations below the optimum but selected against in combinations above the 
optimum. Again the greater the number of unfixed genes in a population, 
the smaller must be the average effectiveness of selection for each one of 
them. The more intense the selection in one respect, the less effective it 
can be in others. The selection coefficient for a gene is thus in general a 
function of the entire system of gene frequencies. As a first approximation, 
relating to a given population at a given moment, one may, however, as- 
sume a constant net selection coefficient for each gene. Assume that the 
genes a and A tend to be reproduced in the ratio (1—s):1 per generation. 
The gene array [(1—q)a+qA] becomes [(1—s) (1—q)a+qA]/[1—s(1—q)]. 
The change in the frequency of A is Aq = [sq(1—q) ]/[1—s(1—q)] or with 
sufficiently close approximation Aq = sq(1—q) if the selection coefficient is 
small. 

A second approximation may be obtained by considering the zygotic 
frequencies. Assume that the types aa, Aa, and AA, reproduce in the 
ratio (1—s’) : (1—hs’):1 per generation. The change in the frequency of 
A to a sufficiently close approximation is Aq =s’q(1—q) [1—q+h(2q—1)] 
(Wricut 1929). In the case of selection for or against a complete reces- 
sive (h=0, s’ negative or positive respectively), Aq =s’q(1—q)*. 

The case of no dominance (h=}) is the same_as the case of genic selec- 
tion except that the selection against the gene is s’/2 instead of s. 

The two factor case in which the phenotypes aabb, aaB—, A—bb and 
A—B-— reproduce at the rates (1—s,,):(1—s,):(1—sp):1 respectively 
yields (for low values of the selection coefficients) : 


Aqa = qa(i i qa)? [sa (Sab a oer sp)(1 = qs)*]. 


The frequency of A depends on the frequency and selection of B, becom- 
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ing independent only if sa, =Sa+sp, that is, if the two series of genes are 
cumulative with respect to selection. It does not seem profitable to pursue 
this subject further for the purpose of the present paper, since in the gen- 
eral case, each selection coefficient is a complicated function of the entire 
system of gene frequencies and can only be dealt with qualitatively. 
Attention may, however, be called to HALDANE’s (1924-1927) studies of 
selection rates and of the consequent number of generations required for 
unopposed selection to bring about any required change in gene frequency 
under various assumptions with respect to mode of inheritance and system 
of mating. 


Equilibrium under selection 


There may be equilibrium between allelomorphs as a result wholly of 
selection, namely, selection against both homozygotes in favor of the het- 
erozygous type. Putting Aq=s’q(1—q)[1—q+h(2q—1)]=0 gives q= 
[1—h]/[1—2h] as the condition.2 This includes the case of selection 
against both homozygotes and also that in favor of them, but examination 
of the signs of Aq above and below the equilibrium point shows that only 
the former is in stable equilibrium in agreement with FISHER (1922). The 
linkage of a favorable dominant with an unfavorable recessive of another 
series is a case in which selection would be against both homozygotes as 
Jones (1917) has pointed out, and stressed as a factor in the vigor of hetero- 
zygosis. In a population produced by the intermingling of types in which 
different deleterious recessives have become fixed, there will be a tem- 
porary selection in favor of the heterozygotes even without any linkage at 
all. Unless the linkage is very strong, however, this effect does not persist 
long enough to have much effect on gene equilibrium. The extreme case of 
equilibrium of the sort discussed here is, of course, that of balanced lethals, 
found in nature in Oenothera. 


In the two factor case, discussed in the preceding section, 


Sa 
Aq. = Oif qe = 1 — 4/—*— 
q q Sa + Sb — Sab 


Sb 
and Aq,=0 if qa =1 — / . 


Sa + Sp — Sab 








2 The condition can be expressed in a more symmetrical form by using a different form of 
statement of the selection coefficients. Assume that the rates of reproduction of the three types 


with 





Sa 

aa, Aa and AA are as (1—s,):1:(1—s,). The value of q at equilibrium comes out q= rs 
SaTSa 

stable equilibrium only for positive values of the two selection coefficients. 
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There may be equilibrium here, if s, and s; are alike in sign, and Say is 
either opposite in sign or of the same sign and smaller, but it is an unstable 
equilibrium. Of more general importance, perhaps, is the equilibrium 
reached by a deleterious mutant gene. For mutation opposed by genic 
selection Aq = —uq+sq(1—q) =0,q=1-— u/s. For mutation opposed by 
zygotic selection (aa, Aa and AA reproducing at rates (1—s’):(1—hs’):1 
it is easily shown that q=1— u/hs’ (WricHT 1929), unless h approaches 0. 
Thus with no dominance, q=1— 2u/s’, and for selection against a dom- 
inant mutation, q=1— u/s’. The important case of selection against a 
recessive is that in which h=0. The formula becomes q=1—~V/u/s’. All 
of these cases are illustrated in figure 1 in which the ordinates show the 
selection pressure as related to factor frequency, under the different condi- 
tions of selection. The intersections with the straight line representing 
mutation pressure give the points of equilibrium. If deleterious dominant 
and recessive mutations occur with equal frequency and are subject to the 
same degree of selection, the frequency of the recessive mutant genes will 
be greater than that of the dominant ones in the ratio V/u/s’ to u/s’. The 
corresponding figure for factors lacking dominance is 2u/s’, where s’ is the 
selection against the homozygote. These considerations alone should lead 
to a marked correlation in nature between recessiveness and deleterious 
effect. This correlation is further increased by tne greater frequency of re- 
cessive mutation which seems to be a general phenomenon. It is this corre- 
lation which gives the theoretical basis for the immediate degeneration 
which usually accompanies inbreeding, a process which increases the pro- 
portion of recessive phenotypes. 

The amount to which gene frequency in a subgroup may depart from 
the species average as a result of local selection held in check by popula- 
tion interchange with other regions may be calculated by solving the quad- 
raticAq =sq(1—q) —m(q—q,) =0. If the local selection coefficient is much 





greater than the proportion of migration (s> m), q=1-=(1-qn) or 


—mqm/s depending on the direction of selection, formulae analogous to 
those for the equilibrium between mutation and selection. If, on the other 
hand, selection is weak compared with migration (s< m), the departure 


from qm is small and q=qm[{1 +— (1—qm)]. This case is doubtless the 


more important in nature. Large subgroups living under different selec- 
tion pressures should show gene frequencies clustering about the average 
according to this expression. The effect of small size of the subgroups in 
bringing about random deviation in this and other cases is not here con- 
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sidered. The case in which s and m are of the same order of magnitude may 


be illustrated by the case.of exact equality. Here q=W/qm or 1—V/1—qm 
depending on the direction of selection. 


Multiple allelomor phs 


The foregoing discussion has dealt formally only with pairs of allelo- 
morphs, a wholly inadequate basis for consideration of the evolutionary pro- 


Pressure o 
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Equilibrium 


FiGurE 1.—Rate of change of gene frequency under selection or mutation. Genic selection 
(A, a reproducing at rates 1:1—s); Zygotic selection: dominant (B-, bb at rates 1:1—s), recessive 
(cc, C— at rates 1:1—s), intermediate (DD, Dd, dd at rates 1:1—4$s:1—s), Mutation such that 
u=—0.05s. Intersections of line of mutation pressure with those for selection pressure deter- 
mine the equilibrium frequencies. 


cess unless extension can be made to multiple allelomorphs. Among the 
laboratory rodents some 40 percent of the known series of factors affect- 
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ing coat color are already known to be multiple. The number of multiple 
series is large in other organisms, for example, Drosophila (MorGan, SturR- 
TEVANT and BrincEs 1925). It is not unlikely that further study will indi- 
cate that all series are potentially multiple. In this case, each gene has a 
history which is not a mere oscillation between approximate fixation of 
two conditions but a real evolutionary process in its internal structure. 
Presumably any particular gene of such an indefinitely extended series can 
arise at a step from only a few of the others* and in turn mutate to only a 
few. Since genes as a rule have multiple effects and change in one effect 
need not involve others, it is probable that in time a gene may come to pro- 
duce its major effects on wholly different characters than at first. Contin- 
uing this line of thought, it indeed seems possible that all genes of all 
organisms may ultimately be traced to a common source, mitotic irregu- 
larities furnishing the basis for multiplication of genes. 

The relative frequencies of all allelomorphs in a series tend, of course, 
to remain constant in the absence of disturbing forces. The zygotes reach 
the equilibrium of random combination of the genes in pairs by the second 
generation from any initial constitution of the population. The effects of 
the various kinds of evolutionary pressure on the frequency of each gene 
may be treated as before by contrasting each gene with the totality of its 
allelomorphs. In the binomial expression [(1—q)a+qA], A may be under- 
stood as representing any gene, and a as including all others of its series. 
Such treatment, however, requires further qualification with regard to 
the constancy of the various coefficients. It may still be assumed that the 
rate (u) of mutational breakdown of the gene in question (A) is reasonably 
constant, but its rate (v) of mutational origin from allelomorphs must be 
expected to change. This may be expected to rise to a maximum, as genes 
closely allied to A in structure become frequent, and to fall off to zero as 
changes accumulate in the locus. Even at its maximum, however, the rate 
of formation should in general be of the second order compared with the rate 
of change to something else, simply because it is one and its alternatives 
many. Moreover, there is an indication that the genes which become more 
or less established in a population are not a random sample of the types of 
mutations which occur. It has been the common experience that mutations 
are usually recessive. Recessiveness is most simply interpreted physiologi- 
cally as due to inactivation which may well be the commonest type of 
mutational change. But the evolutionary process presumably involves in- 

3 Those most closely related genetically, however, need not always be closest in effect. 


The complete inactivation of a gene in a particular respect may for example occur more freely 
than a partial inactivation. 
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crease in activity of genes at least as frequently as inactivation with the 
consequence that the rate of formation (v) of genes of evolutionary sig- 
nificance becomes negligibly small in comparison with rate of breakdown 
(u) of such genes. It should be said that FisHeR has advanced an alterna- 
tive hypothesis according to which genes originally without dominance be- 
come dominant through a process of selection of modifiers (FISHER 1928, 
1929, WRIGHT 1929, 1929a). 

The selection coefficient, s, relating to a gene A cannot be expected to 
be constant if the alternative term a includes more than one gene. The 
coefficient should rise to a maximum positive value as A replaces less use- 
ful genes but should fall off and ultimately become negative as the group 
of allelomorphs comes to include still more useful genes. But as already 
discussed, even if A has only one allelomorph, the dependence of the 
selection coefficient on the frequencies and selection coefficients of non- 
allelomorphs keeps it from. being constant. The existence of multiple 
allelomorphs merely adds another cause of variation. 


Random variation of gene frequency 


There remains one factor of the greatest importance in understanding 
the evolution of a Mendelian system. This is the size of the population. 
The constancy of gene frequencies in the absence of selection, mutation or 
migration cannot for example be expected to be absolute in populations of 
limited size. Merely by chance one or the other of the allelomorphs may 
be expected to increase its frequency in a given generation and in time the 
proportions may drift a long way from the original values. The decrease 
in heterozygosis following inbreeding is a well known statistical conse- 
quence of such chance variation. The extreme case is that of a line propa- 
gating by self fertilization which may be looked upon as a self contained 
population of one. In this case, 50 percent of the factors with equal rep- 
resentation of two allelomorphs (that is, in which the individual is hetero- 
zygous) shift to exclusive representation of one of the allelomorphs in 
the following generation merely as a result of random sampling among the 
gametes. From the series of fractions given by JENNINGS (1916) for the 
change in heterozygosis under brother-sister mating (population of two) it 
may be deduced that the rate of loss in this case is a little less than 20 per- 
cent per generation. A general method for determining the decrease in 
heterozygosis under inbreeding has been presented in a previous paper 
(Wricut 1921). It can be shown from this that there is a rate of loss of about 
1/2N in the case of a breeding population of N individuals whether equally 
divided between males and females or composed of monoecious individ- 











EVOLUTION IN MENDELIAN POPULATIONS 107 


uals, assuming pairs of allelomorphs. HacEpoorw 1921) has urged the 
importance of such random fixation as a factor in evolution. 

Another phase of this question was opened by FIsHER (1922) who at- 
tempted to discover the distribution of gene frequencies ultimately reached 
in a population as a result of the above process. He studied a number of 
conditions relative to mutation and selection. He does not state the rate 
of decrease in heterozygosis (where any) which would follow from the solu- 
tions which he reached but this can be deduced very directly from them. 
It comes out 1/4N for a population of N breeding individuals in the absence 
of selection or mutation. This is just half the rate indicated by the method 
referred to above. 


FIGURE 2. 


Rate of decrease in heterozygosis 


The following symbols and formulae were used in the previous paper in 
determining the consequences of systems of inbreeding. Primes were used 
to indicate the number of generations preceding the one in question. Only 
pairs of allelomorphs are considered here. 

M correlation between genotypes of mates 


b (=V3(1+F’)) path coefficient, zygote to gamete 
1 = _ 
a (- Vx) path coefficient, gamete to fertilized egg 


F (=b?M) correlation between uniting egg and sperm, also, total pro- 
portional change in heterozygosis. 

P (=2q(1—q)(1—F)) proportion of heterozygosis. 

The general formula for the correlation between uniting gametes is easily 
deduced and has been used as a coefficient of inbreeding in dealing with 
complex livestock pedigrees (WRIGHT 1922, 1923, 1925, McPHEE and 
Wricut 1925, 1926), 


F = 2[(3)™*™+1(1 + Fa)]. 


Here Fy is the coefficient of inbreeding of any common ancestor that 
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makes the connecting link between a line of ancestry tracing back from the 
sire and one tracing back from the dam. The numbers of generations from 
sire and dam to such a common ancestor are designated n, and nq respec- 
tively. The contribution of a particular tie between the pedigrees of sire 
and dam is (3)"s+"«+! (1+F,) and the total coefficient is simply the sum 
of all such contributions. This formula makes it possible to compare quan- 
titatively the statistical situation in actual populations with that in ideal 
populations. 

In dealing with regular systems of mating the method of analysis con- 
sists in expressing the correlation between mated individuals in terms of 
path coefficients and correlations pertaining to the preceding generation 
(M = ¢(a,’b,’M’) and from this obtaining expression for F in terms of the 
F’s of the preceding generations. 

Consider a population in which there are N,, breeding males and N; 
breeding females, and random mating. The proportion of matings between 
full brother and sister will be 1/(N,,N;), that between half brother and sis- 
ter (N..+N;:—2)/(N»N;), and that between less closely related individuals 
(Nm—1)(N;—1)/(N..N;). The correlation between mated individuals may 
be written as follows, giving due weight to these three possibilities: 








M = 2’>'] —_02 + 2M’) gee 3M’) 
NaN; . -_ "° ” 
eS 
(Nu — 1)(Nr — 1) sw} 
NaN: 


This leads to the following formula for proportional change in hetero- 
zygosis since the foundation period: 


Nua + N; 
F = F’ ——— }(1 — 2F’+ F”’). 
i ( 8N..N¢ yc . ) 


The proportion of heterozygosis may be written, relative to that of pre- 
ceding generations: 
Nua + N 
P= Pp’ — (== )er’-P', 
SNiwNe 


It is to be expected that the proportional change per generation will reach 
approximate constancy. This rate may be found by equating P/P’to P/P’’ 


8 fs Ea 
2 4N nN: ° 4N..N; 
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This gives (1/8Nn+1/8N;)(1—1/8Nm—1/8N;) as a close approxima- 
tion even for the smallest populations while for reasonably large ones the 
form 1/8N.+1/8N; is sufficiently accurate. 

The simplest case is that of continued mating of brother with sister 
(Nm=N;=1). The rate of loss of heterozygosis comes out 3(3—/5) or 
19.1 percent per generation. The formula for proportion of heterozy- 
gosis takes the form P=}P’+}P” as given in the previous paper, with 
results in exact agreement with those derived by JENNINGS (1916) by 
working out in detail the consequences of every possible mating from gen- 
eration to generation. 

Another simple case is that in which one male is mated with an indef- 
initely large number of half-sisters. This is approximately the system of 
breeding continuously within one herd, headed always by just one male. 
In this case Nn=1, Nr=~, with rate of loss of heterozygosis of 11.0 per- 
cent per generation in agreement with previous results (WRicuT 1921). 

With a relatively limited number of males but unlimited number of fe- 
males, the rate becomes approximately 1/8N.,,. 

An especially important case is that in which the population is equally 
divided between males and females. Here N..=N;=4N and the rate of 
loss is approximately 1/2N (or somewhat more closely 1/(2N+1)) where 
N is the total size of the breeding population. 

It is not, perhaps, clear at first sight that a population of N monoecious 
organisms, in which self fertilization is prevented, should show a decrease 
in heterozygosis exactly equal to that in a population of the same size 
equally divided between males and females. The chance that uniting 
gametes come from full sisters is 2/[N(N—1)], the chance that they come 
from half sisters is 4(N —2)/[N(N—1)] while the chance that they come 
from less closely related individuals is (N —2)(N —3)/[N(N —1)] giving 


a’b’2 


~ N(N-1) 





[2(2+2M’)+4(N —2)(1+3M’)+(N—2)(N—3)4M’] 


1 
P=Pp’ — —(2P’ — Pp”) 
2N 


exactly as in the preceding case. 

The somewhat arbitrary case in which the gametes produced by N mo- 
noecious individuals unite wholly at random is that which can be compared 
directly with FisHER’s results. The gametes have a chance 1/N of com- 
ing from the same individual and of (N —1)/N of coming from different in- 
dividuals. The correlation between uniting gametes may thus be written 
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As might be expected, the result does not differ appreciably from that 
of the preceding case. The rate of loss of heterozygosis is exactly 1/2N in- 
stead of merely approximately this figure. The simplest special case is, of 
course, continued self fertilization in which N =1 and the formula gives 
the obviously correct result of 50 percent loss of heterozygosis per genera- 
tion. 

From the mode of analysis it might be thought that the loss in heterozy- 
gosis is wholly the consequence of the occasional matings between very 
close relatives. This, however, is not the case. If instead of random sam- 
pling of the gametes produced by the population it is assumed that all 
individuals reproduce equally and that inbreeding is consistently avoided 
as much as possible, the percentage of heterozygosis still falls off. The rate 
of loss is, however, only about half as rapid (approximately 1/4N) in a 
reasonably large population equally divided between males and females. 
The cases of N = 2, 4, 8 and 16 have been given previously (WRIGHT 1921). 

In dealing with heterozygosis in the foregoing, it has been assumed for 
simplicity that each locus was represented by only two allelomorphs in 
the population in question and that either complete fixation or complete 
loss of a particular gene means homozygosis of all individuals with re- 
spect to the locus. But in any case beyond that of self fertilization, more 
than two allelomorphs may be present and complete loss of the gene no 
longer implies homozygosis of the locus. The initial rate of loss of hetero- 
zygosis in a large population may thus be only 1/4N with gradual approach 
to 1/2N as the number of loci with only two remaining allelomorphs in- 
creases. The rate of decay of the distribution of gene frequencies is 1/2N 
regardless of number of allelomorphs. 


The population number 


It will be well to discuss more fully, before going on, what is to be under- 
stood by the symbol N used here for population number. The conception 
is that of two random samples of gametes, N sperms and N eggs, drawn 
from the total gametes produced by the generation in question (N/2 males 
and N/2 females each with a double representation from each series of al- 
lelomorphs). Obviously N applies only to the breeding population and 
not to the total number of individuals of all ages. If the population fluctu- 
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ates greatly, the effective N is much closer to the minimum number than, 
to the maximum number. If there is a great difference between the number 
of mature males and females, it is closer to the smaller number than to the 
larger. In fact, as just shown, a population of N,, males and an indefinitely 
large number of females is approximately equivalent to a population of 
4N individuals equally divided between males and females. 

The conditions of random sampling of gametes will seldom be closely ap- 
proached. The number of surviving offspring left by different parents may 
vary tremendously either through selection or merely accidental causes, 
a condition which tends to reduce the effective N far below the actual 
number of parents or even of grandparents. How small the effective N of 
a population may be is indicated by recent studies of SmiTH and CALDER 
(1927) on the Clydesdale breed of horses in Scotland, in which they find 
a steady increase in the degree of inbreeding (Coefficient F) equivalent to 
that in population headed by only about a dozen stallions. Even more 
striking is the rapid increase in the coefficient of inbreeding in the early 
history of the Shorthorn breed of cattle (McPHEE and Wricut 1925). 


THE DISTRIBUTION OF GENE FREQUENCIES 
AND ITS IMMEDIATE CONSEQUENCES 


No mutation, migration or selection 


On making a cross between two homozygous strains a population is pro- 
duced in which the members of each pair of allelomorphs in which the 
strains differ are necessarily equally numerous. The proportional fre- 
quency of each allelomorph in unfixed series is q=0.50. In an indefinitely 
large population, there should be no change in this frequency in later 
generations (except by recurrent mutation or selection). In any finite 
population, however, some genes will come to be more frequent than their 
allelomorphs merely by chance. This means a decrease in heterozygosis, 
since the proportion of heterozygosis under random mating is 2q(1—q), 
and this quantity is maximum when q=0.50. As time goes on, divergences 
in the frequencies of factors may be expected to increase more and more 
until at last some are either completely fixed or completely lost from the 
population. The distribution curve of gene frequencies should, however, 
approach a definite form if the genes which have been wholly fixed or 
lost are left out of consideration. This can easily be seen by considering a 
case opposite in a sense to that considered above. Suppose that a large num- 
ber of different mutations occur in a previously pure line. The frequency 
ratio of mutant to type allelomorphs is initially (1/2N):(2N—1)/2N 
where N is the number of individuals. The great majority of such muta- 
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tions will be lost, by the chances of sampling, as FISHER (1922) points 
out. Those which persist are largely those for which there has been a 
chance increase in frequency. The distribution curve of frequencies of 
persisting mutations will thus continually spread toward higher frequen- 
cies. There must be a position of equilibrium as far as form is concerned 
between this situation and that first considered, although a uniform de- 
cline in absolute numbers. 

As noted above, decrease in heterozygosis takes place in the early genera- 
tions following a cross without any appreciable fixation or loss of genes. 
But after equilibrium has been reached in the form of the distribution 
curve, further loss in heterozygosis must be identical in rate with fixation 
plus loss. 

In simple cases, the equilibrium distribution of gene frequencies can 
easily be worked out directly. Under brother-sister mating, for example, 
the following relative frequencies of the 4 possible types of mating involv- 
ing unfixed factors are in equilibrium although the absolute frequencies 
of all are falling off 19.1 percent (=1(3—1/5)) each generation as new 
genes enter the fixed states, AA & AA or aa X aa. 


Mating Relative Frequency 
Percent 
AAXAa 7— 3V5= 29.2 
Aa XAa —224+10V5= 36.1 
AAXaa 9— 4V5= 5.6 
Aa Xaa 7— 3V5= 29.2 
100.1 


Similarly in populations of 2 and 3 monoecious individuals with random 
union of gametes, the following relative frequencies are in equilibrium 
although the absolute frequencies are decreasing in each generation by 
exactly 25 percent and 1624 percent respectively verifying the 1/2N of 
theory. 


Gene Frequency Class Frequency 
Percent 
3A:la 32 Case of 2 monoecious 
2A:2a 36 individuals per gen- 
1A:3a 32 eration 
Gene Frequency Class Frequency 
Percent 
5A:la 18.3 
4A:2a 21.0 Case of 3 monoecious 
3A:3a 21.4 individuals per gen- 
2A:4a 21.0 eration 
1A:5a 18.3 


100.0 
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In order to determine generally the distribution of gene frequencies, 
consider the way in which genes (A) with frequency q are distributed after 
one generation of random mating. In a population of N breeding individ- 
uals, each of the specified genes will have 2Nq representatives among the 
zygotes and their allelomorphs 2N(i—q). A random sample of the same 
size will be distributed according to the expression [(1—q)a+qA]”. 
The contribution of this sample to the frequency class with an allelomor- 
phic ratio of qi:(1—q:) will be in proportion to the 2Nqi’th term of the 
above expression and to the number of genes included in the contribut- 
ing class (f). The sum of contributions from all such classes should give 
the 2Nq; ’th term an absolute frequency smaller than its value in the pre- 
ceding generation (f:) by the amount 1/(2N+1) deduced above. Follow- 
ing is the equation to be solved for f as a function of q. 











f(t 1 ) = \2N Dq?2Na,(1 )2NG—a,) f 
IN+1/  [2Nq. 2NQ—q) . 


Replacing summation by integration and letting f=$(q)/2N =¢(q)dq 
we havet 
(41) 2N 
2N+1 |2Nqi |2N(1 — qi)< 








1 
f q?Xa,(1 — q)?NO-a¢(q)dq. 
0 





The cases of 2 and 3 monoecious individuals as worked out by simple 
algebra suggest an approach to a uniform distribution. As a trial let ¢(q) 
=C. It will be found that this makes the right and left members of the 
equation identical and is thus a solution. 


Bia C|2N P(2Nq: + 1)P(Q2N —2Nqit+1) —C 
2N+1  |2Nq: |2N(1 — qi) T(2N + 2) N41 

















The case of loss at rate 1/2N should not differ appreciably from that at 
rate 1/(2N+1). It would appear that after a cross, the gene frequencies 
will spread out from 50 percent toward fixation and loss until a practical- 
ly uniform distribution is reached. The frequencies of all classes will then 
slump at a rate of about 1/2N as 1/4N of the genes become fixed and the 
same number lost per generation. Figure 3 is intended to illustrate this 
situation. 


‘f must be equated to ¢(q)/2N here, rather than ¢(q)/(2N—1), if the convenient limits 0 
and 1 are to be used for integration in place of the limits 1/2N and (2N—1)/2N of the summation 
with its 2N—1 terms. 
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Before finally accepting this solution, however, it will be well to exam- 
ine the terminal conditions. The amount of fixation at the extremes if N 
is large can be found directly from the Poisson series according to which 
the chance of drawing 0 where m is the mean number in a sample ise-™. 








The contribution to the 0 class will thus be (e!+e*+e-3. - -)f= 
2-1 
n't os O58. 
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Factor Frequency 
Ficure 3.—Distribution of gene frequencies in an isolated population in which fixation and 


loss of genes each is proceeding at the rate 1/4N in the absence of appreciable selection or muta- 
tion. y=Lyoe7T/2N, 


This is a little larger than the $f deduced above and indicates a 
small amount of distortion near the ends due to the element of approxi- 
mation involved in substituting integration for summation. The nature 
and amount of this distortion are indicated by the exact distributions ob- 
tained in the extreme cases of only 2 and 3 monoecious individuals. 

Letting Lo be the initial number of unfixed loci (pairs of allelomorphs) 
and T the number of generations we have approximately 
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Unfixed loci in the T ’th generation L;= Lge ~ T/2N 

An analogous formula holds for genes in multiple series, but in this case, 
as previously noted, the rate of fixation of loci is only half that given above. 

The amount of genetic variation with respect to cumulative characters 
is easily calculated assuming for simplicity pairs of allelomorphs. The con- 
tribution of each factor pair to variance, in the case of no dominance, is 
2a*q(1—q) where a is the average difference in effect between plus and 
minus allelomorphs. The general formula for variance in this or any other 
distribution is thus o? = 2a?/q(1—q)¢(q)dq. In the present case in which 
¢(q) =L this reduces to o?=4La?. PEARSON’s 82 comes out with a value 
2.8 a slightly platykurtic distribution. Since the percentage of heterozy- 
gosis for a given factor frequency, q, is 2q(1—q), the formula for heterozy- 
gosis is the same as that for variance except that a? is to be omitted. 

Similarly in the case of dominance, the contribution of a single factor 
pair to variance is 4a?(1—q)?(2q —q?) where a is half the average difference 
in effect between dominant and recessive zygotes. The total variance 
with perfect dominance is thus in general 


o? = da? f (1 — q)*(2q — q*)6(q)dq. 


In the case of a uniform distribution this gives o?=8/15 La.? 

FISHER (1918) has emphasized the importance of a characteristic of the 
population which he calls the dominance ratio. He analyzes the variance 
of characters into three portions, that due to genetic segregation (r*) 
that due to dominance, as something which causes deviations of the pheno- 
type from the closest possible linear relation with the genotype (€), and 
that due to environment. Assuming environment constant, o?=7?+é. 
The simple formulae for the correlations between relatives, to be found if 
there is random mating and no dominance, must be modified, if dominance 
is present depending on the value of the dominance ratio defined as €*/o?. 
Following are examples which he gives: 


5 The author wishes here to correct an error in his 1921 paper which was written without knowl- 
edge of FIsHER’s results cited above. In this paper it was assumed that the correlation with no 
dominance needed merely to be multiplied by the squared correlation between genotype and phen- 
otype, the same as FIsHER’s r°/o?=(1—¢/o*), to obtain that with dominance. This gives correct 
results (if there is no assortative mating) in the case of offspring with parents, all other ancestors 
and also in the case of collaterals where one of the individuals is related to the other through only 
one parent but it is more or less in error in other cases, the fraternal correlation being that most 
affected. The reasoning followed was not exact because a correlation in the deviations due to 
dominance in the cases indicated was overlooked. 
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Correlation 
No dominance Dominance 
Parent and offspring 4 4(1—(e/o?)) 
Brothers 4 4(1—(€/20?)) 
Uncle and nephew 2 1(1—(e/o?)) 
Double first cousins } 4(1— (3e/40?)) 


FIsHER has shown that the contribution of a single factor to e, if there 
is complete dominance, may be written 6? = 4q?(1—q)*a? where q is the fre- 
quency for dominant allelomorphs. Whether a particular dominant gene 
has a plus or minus effect on the character under consideration is immater- 
ial. The contribution due to genetic segregation he gives as 8? = 8q(1 —q)*a? 
thus 


1 
‘ _ f q’(1 — q)*6(q)dq 
~ 0 


oo F(R +B) 








2 f at — q)*6(q)dq + fa — q)*o(q)dq 


In the present case this reduces to } as given by FISHER who also ob- 
tains a uniform distribution of factor frequencies for the case of no mu- 
tation or selection, although a different rate of decay. 


Nonrecurrent mutation 


If mutation is occurring, however low the rate, the decline in heterozy- 
gosis, following isolation of a relatively small group from a large popula- 
tion, cannot go on indefinitely. There will come a time when the chance 
elimination of genes will be exactly balanced by new genes arising by mu- 
tation. The equation to be solved is obviously as follows: 


$(q:) ae |2N 


1 
—_—_- = 2Na1(] = )PNd-avg( )d ‘ 
IN [2Nqi [2N( — 54, " = 








It may be found by trial that the expression ¢(q) = Ciq?+C.(1—q)" 
is a solution. The terminal condition, reduction of the class of fixed genes 
(q=1) by an occasional mutation (contributing to the class q =(2N —1)/ 
2N necessarily involves the appearance of new genes (contributing to the 
class q=1/2N) and therefore means that only the symmetrical solution 
¢(q) =Cq-(1—q)- can be accepted as descriptive of the distribution of 
the entire array of genes at equilibrium (under the rather arbitrary postu- 
lated condition, no selection, no migration, no recurrence of the same mu- 
tations). Letting f=(C/2N) q-(1—q)-! and making }>f=1, 
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© = 310.577 -+log QN—1)] 





or approximately C = 1/(2 log 3.6N) (compare figure 5). 

Before attainment of equilibrium with respect to heterozygosis the 
distribution will pass through phases of approximately the form ¢(q) = 
Ciq-!(1—q)+Cs in which the term C; gradually displaces Cs as the num- 
ber of temporarily fixed genes approaches equilibrium with mutation. 

Each particular gene has a probability distribution for the future which 
spreads in time from the initial frequency in curves which are at first ap- 
proximately normal in form but later (if the initial q was not too close to 1) 
become flat, the chances of complete fixation or complete loss each in- 
creasing by 1/4N each generation. As the chances of complete fixation 
increase, the chance of mutation must be taken into account. The dis- 
tribution passes through phases of the type C2(1—q)-'+Cs, C2 gradually 
displacing Cs, relatively, but itself beginning to decline as the chance of 
complete loss increases. With initial q equal (or close) to 1, equilibrium 
with mutation, and hence the hyperbolic distribution, is reached directly. 
The ultimate result in any case is complete loss of the gene in question 
(still assuming no recurrence of the same mutation and hence mutation of 
the gene but not /o it). If there is reverse mutation, but at very low rate, 
a term C,q-! must be added to the formula, and an equilibrium will be 
reached in the form Cq-'!(1—q)-. This last formula means that in the 
long run (assuming no disturbances from selection, migration, etc.) the 
gene will usually be found either completely fixed or completely absent 
from the population (with frequencies proportional to the mutation rates 
to and from the gene respectively) but that occasionally fixation or ab- 
sence will not be quite complete and that at extremely rare intervals the 
gene will drift from one state to the other. 

The turnover among genes in equilibrium in the distribution Cq-"(1—q)* 
can be determined from’consideration of the variance of q, and indepen- 
dently by application of the Poisson law. 

Let o,2= =(q—}4)*f/2f be the variance of q, excluding the terminal clas- 
ses, the summation including 2N—1 terms. This variance is increased in 
the following generation by the spreading out of each frequency class as a 
result of random sampling. The variance from the spreading of a single 
class is q(1—q)/2N and the average is thus 


Yq(1 — q)f =(3 ) - 


QINZf OO N\4  */)~SOQN) 
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The sum o,?+Ac,? includes the newly fixed factors whose contribution 
is }K where K is the rate of fixation, plus loss, but excludes mutation. 

Digressing for a moment to the case of no mutation but equilibrium of 
form, we have at once 


o,? + Ac? = Ki + (1 — K)o,? 


1 1 
(x ) (o.*- 7) =O giving an independent demonstration 


that the rate of decay is 1/2N in this case. 
Returning to the case of equilibrium under mutation, the contribution 
of new mutations to variance is K(N —1)?/(2N)?. 





9 9 r r N pes! 1 ¥ 9 
g,7 + Ac,? — 3K + K( ~) =¢,° 
a a 

~~ 2 log 3.6N° 


The proportion exchanged at each extreme is thus about half the corre- 
sponding subterminal class where N is large (f1 =fxy_1 =2NC/(2N—1) by 
this method. This compares fairly well with the proportion as determined 
by the Poisson law, which is 0.46 times the subterminal class instead of 
0.50. 

The equilibrium frequencies can be worked out algebraically in simple 
cases. The figures below give the results in the case of a population of 3 
monoecious individuals for comparison with the theoretical values de- 
duced above. The rate of exchange at each extreme is actually 10.8 

5 TN ox 
percent in comparison with 11.0 percent as - (-) the subtermi- 


nal class, or 11.4 percent from the formula ——-———.. The case of ir- 
4(.577+log 5) 
reversible mutation is also given. 


Reversible Mutation Irreversible Mutation 
Gene frequency Exact 

equilibrium Ca" (1-9) Exact C(1—9)" 

5A:la ate 26.3 47.7 43.8 

4A:2a 15.4 16.4 20.6 21.9 

3A:3a 14.1 14.6 14.1 14.6 

2A:4a 15.4 16.4 10.2 10.9 

1A:5a 27.5 26.3 7.3 8.8 

Totals 99.9 100.0 99.9 100.0 
Terminal exchange 10.8 11.0 18.0 18.25 
Loss 3.6 3.65 
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The number of unfixed loci (L) which a given mutation rate per indivi- 
dual (x) will support in a population is easily found, assuming only pairs 
of allelomorphs. The number of mutations is KL as well as Nu. There- 
fore L=Nyu/K=2Ny log 3.6N. The variance of cumulatively determined 
characters worked out as in the preceding case comes out 2N wa? in the case 
of no dominance and 10/3 Nya? in the case of dominance, in both cases, 
directly proportional to the size of population® and to the mutation rate. 
In view of the piling up of new mutations, one might perhaps, expect to 
find a leptokurtic distribution for characters. This, however, turns out not 
to be the case: PEARSON’S 82 comes out exactly 3 in the case of no domi- 
nance on substitution in the general formula 


f q(1 — q)[1 — 6q(1 — q)|l6(q)dq 


0 





B2e=3+ - 
| J qi — ao(a)da | 


FISHER’s dominance ratio comes out 1/5 in this case. 

The preceding results differ somewhat from those presented by FISHER 
(1922). The latter’s analysis was based on a transformation of the scale of 
factor frequencies designed to make the variance due to random sampling 
uniform at all points. The variance at a given value of q isq(1—q)/2N. 
FISHER assumes that if the ratio of small differences on the q scale to the 
corresponding differences on a new @ scale be made proportional to the 
varying standard deviation of q, the standard deviation on the @ scale will 
be uniform. Letting dq/d@=./q(1—q) leads to the transformation @= 
cos-!(1—2q) with uniform variance of factor frequencies of 1/2N. Let- 
ting y=F (6) be the distribution of factor frequencies in one generation, 
he wrote that in the next as 





rg ca ; 66° 
y+ Ay om f —_—e— 80" /20 (y + y’50 a4 — ee -) 
0 ov/2r |2 


and measuring time in generations (T) he reached the equation 


Oy 1 oy 


oT 4N 00? 
After noting that the solution for the symmetrical stationary case is 
* These estimates of number of unfixed loci and of variance depend, of course, on the validity 


of the conditions on which the formula of the distribution curve is based. How far the mutation 
rate per locus can be considered negligibly small as size of population increases is discussed later. 
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y =L/r, he proceeded to derive the formulae for increasing and decreas- 
ing y. Considering the latter, dy/dT = —Ky where K is the rate of decay, 
giving 1/4N d*y/d6é?=—Ky as the equation to be solved. In the symmet- 
rical case this yields y=C cos [,/4NK(0—7/2)] where C =\/4NK/(2 sin 
(32+/4NK)] in order to give a total frequency of unity and is to be mul- 
tiplied further by Loe~** to show change from the initial frequency of Lo. 

The maximum value which K can take without giving negative frequen- 
cies within the range is obviously 1/4N and FisHer found reason for ac- 
cepting this as the value in the case of no mutation. The formula for the 
distribution in this case reduces to y =} sin 0. FISHER transformed these 
equations to the scale Z=log [q/(1—q)] in which the case of no mutation 
becomes y = 3 sech? 3Z and the case of loss balanced by mutation becomes 
y =1/2mrsech}Z. This transformation brings the curves into an approach 
to the form of the normal probability curve. For our present purpose it is 
preferable to transform to the scale of actual factor frequencies. The case 
of steady decay becomes y = 1 with which my results are in agreement, al- 
though in disagreement as to rate of decay. In the case of loss balanced by 
mutation, F1sHER’s formula transforms into y = 1/[x/q(1—q)] instead of 
1/[2(log 3.6N)q(1 —q)] as developed in the present paper. FISHER obtained 
V1 /2N*/2u for the number of unfixed factors, in contrast with 2Nu log 











. 2 ‘ ; , a 
3.6N ; and 2 for the factor turnover in contrast with 1/ [2 log 3.6N ]. 


It will be seen that FisHER’s solution leads to a smaller number of unfixed 
factors with more rapid turnover in very small populations (less than 81) 
but to a larger number of such factors with slower turnover in larger popu- 
lations. In a breeding population of one million with one mutation per 
1000 individuals, FisHER’s formula gives 1,250,000 unfixed factors with 
a turnover of 0.08 percent while I get 30,000 unfixed factors with a turn- 
over of 3.3 percent. 

The exact harmonizing of the results of the two methods of attack has 
been a somewhat puzzling matter, but Doctor FISHER, on examination of 
the manuscript of the present paper, has written to me the following which 
I quote at his suggestion. ‘‘.... I have now fully convinced myself that 
your solution is the right one. It may be of some interest that my original 
error lay in the differential equation 








Oy m= Te 2 
dT 4N 00? 
which ought to have been 
Ov Sai 1 @y 
a ee sas Sema ly ee ak 
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the new term coming in from the fact that the mean value of 5p in any gen- 
eration from a group of factors with gene frequency p is exactly zero,’ and 
consequently the mean value of 69 is not exactly zero but involves a mi- 
nute term —1/4N cot 6. With this correction, I find myself in entire 
agreement, with your value 2N, for the time of relaxation and with your 
corrected distribution for factors in the absence of selection.” 


Reversible recurrent mutation 


It only requires a very moderate mutation rate in a large population 
for the number of unfixed loci to become enormous. This raises the ques- 
tion as to the effect of a limitation in the number of mutable loci, and re- 
currence of mutations. 

Consider now the case of genes with uniform rates of recurrence of mu- 
tation and reverse mutation. Let u be the rate per generation for break 
down of the gene A and v that for origin from allelomorphs. A class of 
genes with frequency q (that of all allelomorphs, 1 —q) will be distributed in 
the following generation under random sampling according to the expan- 
sion of the expression 


{{(1 — q) — v(i — q) + uq]a + [q + v(1 — q) — uq]A}?%f. 


Equating the total contribution to a given class, to the frequency of 
this class in the parent generation, reduced by the proportion K, if there 
is a uniform rate of decay, gives as the equation to be solved: 


-K_ la 
2N |2Nqi |2N(1i — qi) 





o(a) > fF faa — wv) + vpsu 





[1 — q(i — u — v) — vPN@-a09g(q)dq. 


It will be found by trial that the right and left members became identi- 
cal in certain cases in which ¢(q) is of the form q*(1—q)* 
Let x=q(1—u—v)+v 








PI. ind q dx 
a t-«-4 ae eer 
x* — svxs-!..- (1 — x)t — tu(1 — x)" 
"* C - 
(1 — u — v)* (1-—u-—v)t 


7 p is the q of the present paper. Since the above was written, FisHER has published this re- 
vision of his results in The genetical theory of natural selection, 1930. 
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The small amount of spread from a given class will justify retention of 
the untransformed limits of integration. 


dre, s(i+s) ; 
Noting that I'(c+s+1)=Jc c (1 +— x ae approximately when 
- c 


c is an integer and s is small compared with c, and making use of the fol- 
lowing derived relation 


2N|2N 
2Nq. [2NG = a 
_ 4N + s(s + Iqrt + t(t + 1) — qu)" 

ze 4N +(s+t+i)(st+t +2) 





1 
J x2Naits(] w24 x)2NG-av +tdx 
) Jo 





qi*(1 — qu)* 
the equation may be written as followsf or small values of s and t (com- 
pared with N) and values of u and v of a still lower order of size. 

(1 on K)qis(1 _ qi)* 


1 4N 
= = s(1 — 1)* 
PET Tipe, EY SSeS ES ue w) 














s(s+1) 4Nsv 
eras arty ge TT OST eTS aaipemed 
+( t(t+1) a 4Ntu Jart—ao| 
4N+(s+t+1)(s+t+2) 4N+(s+t)(s+t+1) 


The coefficients of qi*!(1—q:i)* and of qi:*(1—q:)* must equal 0 either 
under complete equilibrium or equilibrium merely in form of distribution. 
Neglecting small quantities: 


s=0 or s=4Nv-1 
t=0 or t= 4Nu —1. 


In the case of complete equilibrium (K =O), it turns out that the co- 
efficients of qi*(1—q:)*‘ in the left and right members are also satisfied to a 
first approximation by s=4Nv—1,t=4Nu—1. They are also satisfied by 
letting s=0, t=0 provided that u+-v=1/2N. The relation between the 
fixed terminal and the unfixed subterminal classes, however, requires that 
u=v=1/4N in this case, which thus becomes merely a special case of the 
first solution. Similarly, the solutions s=0, t=4Nu—1 ands=4Nv—1, 
t=0 require that v=1/4N and u=1/4N respectively and thus also reduce 
to special cases of the first solution. It appears then that the distribution 
of gene frequencies in equilibrium under mutation and reverse mutation 
may be represented approximately by curves of PEARSoN’s Type I, 
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. r(4Nu + 4Nv) 


$(q) = r(4Nu)P'(4Nv) 





q*Nv-1(1 ee q)4Ne-1 r 


The terminal conditions are of interest in this and other cases to be con- 
sidered. The factor turnover at each extreme may be written 


Ko = +f; = 2Nvfo 
Kon = $fen-1 = 2Nufen 


where the subterminal classes have the frequencies 


i = aay) 
1 = NON 


1 1 
las > ae agp 


In the present case, the terminal classes have the frequencies fo = C/[4Nv 
(2N)*N"] and fey =C/[4Nn(2N)*%"] where C is the coefficient in the expres- 
sion for (q). 

It will be seen that the form of the curve depends not only on the rates 
of mutation of the genes but also on the size of the breeding population. 
With small populations or rare recurrence of mutations, the distribution 
approaches the symmetrical form y = 1/(2 log 3.6N) q-1(1—q)-1 already dis- 
cussed (figure 5). The ratio of the class of temporarily fixed genes (fy) to 
the class of complete absence (f,) must be approximately v:u in this case 
in order that the number of mutations at each extreme of the symmet- 
rical distribution of unfixed factors may be equal. 

With increase in size of the population, the gene frequencies tend in 
general to be distributed in asymmetrical U— or even I— or J—shaped 
curves. For example, if the size of population reaches 1/4u and v is much 
smaller than u, the distribution will be the hyperbola ¢(q) =Cq" with a 
piling up of factors with few or no plus representatives. 








With sufficient increase in the size of population, the distribution at 
length takes a form approaching that of the normal probability curve; 
centered about the point q=v/(u+v) which, indeed, is always the mean 


(4 = f q¢(q)dq = —). 


The variance of gene frequencies, ¢,?= f,(q—q)6(q)dq is 
aq -a) 
4N(u+v)+1 
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The amount of genetic variation of cumulative characters may be cal- 
culated as before. In the case of no dominance and paired allelomorphs 
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Ficure 4.—Distribution of type genes in an isolated population in which equilibrium has 
been reached with destructive mutation but has not been approached with respect to formative 
mutation. y=4NuLoe*T(1—q)*Ne-! with 4Nu much smaller than 1 and the formula approxi- 
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it is ¢? =2La* 4Nuq/(4Nu+4Nv-+1) or 2La?[q(1—q)—o,?]. Where u is 
much greater than v, this can be written approximately o? = 2La*4Nv/(4Nu 
+1) approaching 2La?q as a limit, as N increases and L comes to include 


all loci. 
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FicureE 5.—Distribution of gene frequencies (or probability array of gene) where equilibrium 
with mutation has been attained. Population so small that the terms 4Nu and 4Nv are both much 


an = q*(1—q) 
smaller than 1. y= Cq*N¥(1—q)*N"—, approximately ——————— 


2 log 3.6N 
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As the formula for this case was derived on the assumption of small val- 
ues of u and v, it is desirable to obtain an independent test of its applica- 
bility to large values. This can be done as follows: the increase in variance 
of q due to random sampling is 


gs q(1—q) —o, 
{- dq = 2 
rr J q(1—q)¢(q) dq a 





Letting Aq = —uq+v(1—q) be, as before, the change in q due to mutation, 
q+Aq—qG=(q—q)(1—u—v). Thus the effect of mutation is as if all devia- 
tions from the mean were reduced in the proportion (1—u—v). The de- 
crease in the variance of q, due to mutation is therefore o,2[1—(1—u—v)?]. 
At equilibrium the increase in o,? due to random sampling is exactly bal- 
anced by the decrease due to mutation yielding: 


oa q(1—q) 
*  4N(u+v)—2N(u+v)?+1. 





The term —2N(u+v)? in the denominator is important only when (u+v) 
has a large absolute value. Omitting this, the formula is identical with 
that deduced by the first method and thus gives an independent demon- 
stration of its validity. As mutation approaches its maximum value 
(u+v=1), the variance of q approaches q(1—q)/2N, that due to random 
sampling alone. 


Migration 


The distribution of gene frequencies in an incompletely isolated subgroup 
of a large population can be obtained immediately from the preceding re- 
sults. The change in gene frequency per generation under migration 
Aq = —m(q—qm) can be written —m(1—qm)q-+mqm(1—q) which isin the 


same form as the change of q under mutation, Aq= —uq+v(1—q). We 
may write at once for the distribution under negligible mutation rates: 
r'(4Nm) 





q*Nmam—1(1 _ q)4NmG-am)—1 ‘ 


$(q) = rT [4Nmq. JI [4Nm(1 - qm) | 


The mutation terms 4Nu and 4Nv can be inserted, if mutation rates are 
not negligible. 

Figure 6 shows how the form of the distribution changes with change 
in mor N. Where m is less than 1/2N there is a tendency toward chance 
fixation of one or the other allelomorph. Greater migration prevents such 
fixation. How little interchange would appear necessary to hold a large 
population together may be seen from the consideration that m=1/2N 
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means an interchange of only one individual every other generation, re- 
gardless of the size of the subgroup. However, this estimate must be much 
qualified by the consideration that the effective N of the formula is in gen- 
eral much smaller than the actual size of the population or even than the 
breeding stock, and by the further consideration that q, of the formula re- 
fers to the gene frequency of actual migrants and that a further factor 
must be included if qm is to refer to the species as a whole. Taking both 
of these into account, it would appear that an interchange of the order of 
thousands of individuals per generation between neighboring subgroups 
of a widely distributed species might well be insufficient to prevent a con- 
siderable random drifting apart in their genetic compositions. Of course, 
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FicureE 6.—Distribution of frequencies of a gene among subdivisions of a population in which 
qQm=1/2 (or probability array of gene within a subdivision) under various amounts of intermigra- 
tion. y= Cq*N™am-1(1 —q)*NmG-am)-1, 


differences in the condition of selection among the subgroups may greatly 
increase this divergence. It appeats, however, that the actual differences 
among natural geographical races and subspecies are to a large extent of 
the nonadaptive sort expected from random drifting apart. An interest- 
ing example, apparently nonadaptive, is the racial distribution of the 3 
allelomorphs which determine human blood groups (BERNSTEIN 1925). 

The variance of distribution of values of q among subgroups (in the ideal 
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case) is ¢42=Qm(1—qm)/(4Nm-+1) by substitution in the formula for the 
preceding case. 

The zygotic distribution [(1—q)a+qA |? cannot be expected to hold in 
a population made up of isolated groups among which gene frequency var- 
ies. WAHLUND (1928) has shown that the proportions in each homozy- 
gous class are increased at the expense of the heterozygotes by the amount 
of the variance of the gene frequencies among the subgroups,® the propor- 
tions becoming [(1—q)?+0,?]aa+ [2q(1—q)—20,?] Aa+[q?+o,2] AA. 
By substituting the expression for o,, given above, in WAHLUND’s formula 
one might determine empirically the effective value of 4Nm for the popu- 
lation, except that it would be difficult to rule out the possibility that 
some of the variance of gene frequencies might be due to differences in 
the selection coefficients among the subgroups instead of merely to ran- 
dom drifting apart. 


Irreversible recurrent mutation 


The solution s=0, t=4Nu—1 for the equation reached in the case of 
recurrent mutation satisfies the conditions for equilibrium of form under 
irreversible mutation (v=0), with decay at rate K =u. 


$(q) _ 4NuLoeT(1 — q)4Nu-, 


The proportional frequency of the unfixed subterminal class which is 
not replenished by mutation is fi/(Loe—") =2u, twice the rate of decay 
and thus approximately satisfying the necessary terminal condition. 


For values of u as small as 1/(2N log 3.6N) the coefficient in the expres- 


4NuLpe-"T 


(ax) 
1 — {| —— } Nu 
2N 





sion for ¢ (q) must be calculated to a closer approximation 


which approaches log 3.6N as u approaches zero. 

The distribution of gene frequencies under irreversible mutation is il- 
lustrated in figure 4. 

This case is of most interest as representing for a long time the dis- 
tribution of type genes in a small group isolated from a large one in which 
all type genes are close to fixation. The release of deleterious mutation 
pressure from equilibrium with selection will result in approximate equi- 


8 The percentage of heterozygotes is 2fca(t —q)¢(q)dq where ¢(q) is the distribution of values 
of q among the subgroups. As shown above this reduces to 2g(1—q) —2e,?, thus demonstrating 
WaARLunD’s principle. 
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librium of the form described above. With decay at the rate u, it may 
be a very long time before effects of reverse mutation become appreciable 
and the final equilibrium y=Cq-'(1—q) approached. Assuming that 
type genes are dominant, the dominance ratio in this case is 1/3. 


Selection® 


Using Aq =sq(1—q) as the measure of the effect of genic selection, the 
class of genes with frequency (1—q)a:qA is distributed after one genera- 
tion according to the expression: 


{(1 — q)(1 — sq)a + q[1 + s(1 — q)]JA}*N. 


The distribution of gene frequencies which is in equilibrium may be ob- 
tained from the following equation which represents the total contribu- 
tion to class q; after one generation, as equal to its previous frequency. 


$(q:) 2N 
2N 2N(1 — qi) 











2Nqi 








1 
fae = g)8o-@(1+ s(t 9))Na(1 = sq)8¢-W9(q)dq 
0 


To a first approximation, the selection terms approach the value 
e?Ns(ai—a), The introduction of a factor e?\™ into the previously reached 
formula for ¢(q) gives a solution of the equation (for very small values 
of s) since it cancels the new term e~?N*4 in the integral, and leaves e?N*11 as 
a factor in ¢(q:). This was the basis for the formula published (WRicHT 
1929a) as o(q) =Ce2N*aq*8v-1(1 —q)4N"—! intended to exhibit in combina- 
tion the effects of selection, mutation in both directions and size of popula- 
tion. Further consideration reveals that this solution is the correct one 
only for the case of irreversible mutation and then only when the selection 
coefficient is exceedingly small, less than 1/2N in fact. FisHer (1930) in 
his recently published revision of the results of his method of attack on this 
problem has given a formula for a special case of selection, equilibrium of 
flux from an inexhaustible supply of mutating genes. This is given as ac- 
curate as long as Ns? is small. Assuming one mutation per generation, be 
writes: 

2dp(i — e-4"9) 

pq(1 — e~*") 


® This and the following section have been rewritten since submission of the manuscript in order 
to take account of the correction of my formula, suggested by FisHER’s results in The genetical 
theory of natural selection, 1930 as noted herein. 
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In this formula, a(= —s) is the selection coefficient, p(=1—q) is fre- 
quency of mutant genes and dp may be taken as 1/2N numerically. This 
agrees with my previous formula for irreversible mutation, y =Ce?N*4 
(1—q)- only when s is less than 1/2N, above which value my formula rap- 
idly leads to impossible results. On reexamination of my method, how- 
ever, I find that the same degree of approximation can be reached by it. 
The expansion of [1+s(1—q) ]?%=[1—sq]?N“@-a yields series of terms 
which condense into the expression e?N*(- { 1 —Ns?[qu(1—qui) + (qi—q)?] } 
taking into account terms in Ns?, N*s*, N*s*, N‘s® as well as those in which 
N and s have the same exponent. Since the random deviations of q have 
a variance of qi(1—q:)/2N the term (q:—q) is of the order 4/1/2N. A 
second order approximation should be obtainable by retaining the term 
Ns?qi(i—qu:) while that in Ns*(q:—q)* may be dropped. The equation to 
be solved can now be written. 


2N |2N 
\2Nq: |2N(1 — qi) 








$(q1) = e*Near[1 — Ns*qi(l — qu)] 


ee 
f as%(1 — q)0-ae-009(q)dq. 
“0 


Let $(q) =e?%*aq—"(1 —q) "(a +bq+cq?+daq} - - -). 

The exponential term in the integral being cancelled, it becomes pos- 
sible to carry out the integration by means of the approximate formula 
already used in the case of mutation (page 122). 


2N |2N ~ f q?Nal+z-1(] — q)2NG-av-1dq 
\2Nq: |2N(1 _ qi) 0 





_ 4N + 2(z — 1)qi' 
(5 + z(z — 1) 





qi (1 ae qi)" : 


The resulting coefficients of the powers of q: on the right side of the equa- 
tion may now be equated separately to those of ¢(q:)., To a sufficient ap- 

















ad ae (2Ns)? (2Ns)? (2Ns)* 
proximation it turns out that c = a, d= b,e= ‘, 
|2 |3 \4 
(2Ns) (2Ns)® 
f= —b, g=- , etc 
|5 16 


Letting C, = a/2 and C, = —— 
5 1 / 2 4Ns 


$(q) = 2e?N*aq-1(1 — q)-[Ci cosh 2Nsq + Cz sinh 2Nsq|. 
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From considerations of symmetry, it is obvious that another solution 
may be obtained by replacing q by (1—q) ands by —s. The full solution 
may be written in the form 


¢(q) = qo _ q)" [Ci(etNsa + 1) + C2(e4Nea aie 1) 
+ C;(1 + e-4N#-@) 4+ C,(1 — e-4Ns-a) ], 


The relative values of the coefficients in the case of equilibrium can be 
obtained by setting up the equation for the absence of flux. Each group 
of genes, f=¢(q)dq tends to be shifted by the amount Aq=sq(1—q) ina 
generation. There is thus a total flux measured by f.¢(q)Aqdq unless 
there is counterbalancing mutation. The amount of mutation in each 
direction (assuming the rates of recurrence to be very small compared 
with 1/4N) is approximately half the respective subterminal classes, as 
demonstrated in the preceding cases. 


f; = 2C, + 2sC2 + (1 + e-4N*)C; + (1 — em *N*)CG, 
fon_1 = (e4N* + 1)C, + (e4N* — 1)C. + 2C, + 2sCy. 


Since mutation moves genes from the fixed classes to the subterminal 





4 


1 
classes with gene frequencies of 1/2N and (1 - =) respectively, it 











fon— f 
creates a net flux of —— — — which at equilibrium should balance that 
4N 4N 
due to selection 
. ‘fow-a fs 
Aqdq — = 
J, s@aadq oe ee 


Substitution of the values given above leads to the condition C:—C:2 
+C3;+C,=0. Under this condition the formula simplifies greatly, be- 





1 
coming for all values of s (o1 lower order than Fz) 


$(q) = CeN*ag“(1 — q)-. 


The effect of selection in this case is perhaps best exhibited in the ratio 
of the classes of alternative fixed genes in the highly artificial case of 
equality in the rates of mutation in opposite directions. This ratio is e**. 

Ce4Ns 


More generally, fo =——— and fen = where u and v, both assumed to 
4Nv 4Nu 
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be very small compared with 1/4N, are the opposing mutation rates. The 


number of unfixed loci (pairs of allelomorphs) takes the form 


;: 2Nu 2Nu 
fp #fgeea = C(e* + 1) 





where yw is the mutation rate per individual and C is chosen so that 
S-¢(q)dq =1. The effect of selection on the variance of cumulative char- 
acters (pairs of allelomorphs) may be seen by comparing the formula 


em —1 


© wn Sele 
. ia ase + i) 


with the previously given form 2Nuya? which it approaches as s ap- 
proaches 0. 

In the case treated by FISHER, there is assumed to be irreversible muta- 
tion at the rate of one per generation from an inexhaustible supply. As 
each new gene becomes fixed, it may be considered as transferred to the 
type class, ready to mutate to new allelomorphs in its series. Thus in place 


f 
of a return flux of : 
4N 





f 
» due to reversible mutation, we must write = (if 
v = 0) 


fons, fs at 
4N 2 ; 





J ‘$(q)Aqdq - 


This is solved if C}) =C;=C.,=0 and 
$(q) = Ca(etN*a — 1)q-(1 — q)*. 
In case the direction of mutation coincides with that of selection (u=0), 


fon—1 





f 
the mutational terms must be written — =a giving the solution 


4 


6(q) = Cl — e-OO-W) gq“ — gq). 


These are identical with F1isHER’s results on proper choice of the coef- 
ficient. 

An interesting question which FisHER has discussed, is the chance of 
fixation of a single mutation. This is given by the ratio of the subter- 
minal classes in the formulae just considered. Where selection opposes 
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f 2s 
mutation, om —-, always less than 1/2N. In the case of favor- 
2N-1 Gar = F 
, fon—1 2s ; 
able mutations, ee ae or approximately 2s. FisHer also 
1 ee, 


gives an independent derivation of the last figure. 


General formula 


It is of especial importance to assemble the effects of all evolutionary 
factors into a single formula. Unfortunately, the equation of equilibrium 
of class frequencies becomes rather complicated and has not yet been 
worked through. Presumably the form is given at least approximately 
by a formula of the type Ce*sq*Ny-1(1—q)*N*-! in the case of reversible 
mutation. In order that there may be no flux, f.¢(q)Aqdq=0. It is not 
necessary to consider the terminal classes in this case. Thus 


1 
. J eraqiXv-1(1 — gq)" [— uq + v(1 — q) + sq(1 — q)]dq = 0. 
0 


Integration of the first term (that in —uq) by parts gives an expression 
which is immediately solved by letting a=4N. Thus the selection term 
appears to be e*N*¢ regardless of the rates of mutation provided there is 
reversibility. It is approximately of this value in the case of irreversible 
mutation, discussed above, provided that s is considerably larger than 
1/4N. The conclusions based on the previously presented value e?*s still 
hold,!° except that they should be applied to selection intensities just 
half as great. 

The position of the mode of the I-shaped distribution curve given when 
u and s are greater than 1/4N can be found by equating the differential 
coefficient of the logarithm of the formula to zero. 


4Nv -1 4Nu —1 
4Ns + — = @ 
q = 9 


When v is small but u and s are both large, q approaches the value 





u 
1 — — already given as the equilibrium point. The mean would be some- 
S 


what below this point, as expected from the curvilinear relation of selection 


10 These conclusions were presented at the meeting of the AMERICAN ASSOCIATION FOR THE 
ADVANCEMENT OF SCIENCE for 1929 and were summarized in the abstracts (WRIGHT 1929b). 
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pressure to gene frequency and in contrast with the case of equilibrium 
between opposing mutation pressures (but no selection) in which the mean 





is always the equilibrium point and the mode is more extreme than 
4Nv - 1 
4N(u " v) - 2 
Migration pressure introduces no other complications. Combining all 
factors: 


+v 





this figure, 


$(q) = CetNsag4N(mam+v)—1(1 — q)4N{mG—am)+u)—1 | 


The selection coefficient refers here to the difference between the selec- 
tion in the group under consideration and that in the species as a whole, 
the effect of the latter being taken account of in the mean gene frequency 
of the species qm. 


The distribution curves 


Some of the forms taken by the probability array of gene frequencies, in 
cases involving selection, are illustrated in figures 7 to 14. Figures 7 to 10 
deal with the case in which the rates of mutation are negligibly small 
compared with 1/4N. The curves are thus all variants of the form 
y =Ce*N*aq-1(1—q)-'. Figure 7 illustrates the relatively slight effect of 
selection below a certain relation to size of population. All conditions are 
the same in figure 8 except that the populations are four times as large 
as in figure 7. Thus while the absolute intensities of the selection coef- 
ficients are the same, the relations to size of population are altered." The 
curves bring out the great effect of selection beyond the critical point, 
s= oN (where mutation rates are low). Figures 9 and 10 are intended to 
show the effects of change in size of population where the intensity of 
selection remains constant (low in figure 9, four times as severe in figure 


1 
10). Up to a certain point (x °. figure 0) increase in population 
s 


raises the middle portion of the curve. Above this point (figure 10) in- 
crease in population depresses the middle portion. In the former case, 
the increase in unfixed factors brings increased variability of cumulative 
characters, in the latter there is little change of variability in relation to 
popuiation size, the depression among middle frequencies being balanced 
by the accumulation of nearly but not quite fixed factors. All of these fig- 


tt The probability that increase in the number of unfixed genes would react on the individual 
gene selection coefficients, reducing them, is here ignored. 
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ures (7 to 10) may be taken as representative of conditions in small inbred 
populations which have been isolated sufficiently long to reach equilib- 
rium in relation to mutation. It will be recalled that figures 3 and 4 rep- 
resent successive stages preceding the attainment of such equilibrium. 
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Ficures 7, 8, 9, and 10 —Distribution of gene frequencies in relation to size of population and 


intensity of selection where rates of mutation and migration are small compared with 1/4N. 


Formulae all of type y= Cy*N*4g-1(1—q)7. 


Figure 7. Small population, four degrees of selection. Figure 8. Population four times as large 
as in figure 7 under the same four (absolute) degrees of selection. Figure 9. Three sizes of popu- 
lation under given weak selection, Figure 10. Same three sizes of population as in figure 9, under 


selection four times as severe. 
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Figures 11 to 14 present exactly the same series of comparisons as fig- 
ures 7 to 10, for small populations that are not completely isolated from 
the main body of a species.” ‘In all cases the gene frequency (q.) of the 
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Ficures 11 to 14.—Distribution of gene frequencies in subgroups of large population 
(mean frequency qm=1/2) in relation to size of population and intensity of selection. Formulae 
all of the type y =Ce*NsagtNmam—1(] —q)#NmG-a)—1 Same comparisons as in figures 7 to 10, 

Figure 11. Small subgroups (2Nm=1), four degrees of selection. Figure 12. Subgroups four 
times as large as in figure 11, under same four (absolute) degrees of selection. Figure 13. Sub- 
groups of three sizes under given weak selection. Figure 14. Same three sizes as in figure 13 under 
selection four times as severe. 


The figures may also be used to illustrate cases of equal mutation to and froma gene (u=v). 
y =CetNsagiNv-1(4 —q)*Nu-t 


” These figures also represent the distribution of gene frequencies in population in which 
mutation and reverse mutation are equally frequent, but this seems to be so exceptional a case 
especially under multiple allelomorphism, as to be of little importance. 
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whole species is assumed to be 1/2. The relations of migration to size of 
population are such that there is very little complete fixation of genes. In 
figure 11,m=1/2N and the purely exponential curves show how increasing 
intensity of genic selection shifts a uniform distribution in the direction 
favored by the selection. The fourfold greater population of figure 12 
brings about concentration, in curves approaching the normal in form. 
Figure 13 brings out the concentrating effect of increase in population in 
the case of weak selection while figure 14 does the same for the case of selec- 
tion four times as severe. 

The important case in which mutation is balanced by selection in a 
moderately large population (both s and u large compared with 1/4N) 
is illustrated in figure 19. The four curves represent four degrees of 
selection, rising by doubling of severity at each step from a case in which 
mutation pressure practically overwhelms the effect of selection to the re- 
verse situation. The limiting condition in populations so large that 1/AN 
is very small compared with both s and u is that of concentration of fac- 
tor frequency almost at a single value (figure 20, page 148). 


Dominance ratio 


The form of the distribution of the frequencies of the dominant genes 
affecting a character is of interest in connection with the dominance ratio. 
Since different genes have different mutation rates and selection coeffi- 
cients, this distribution is a composite of curves of the types discussed. 
In small populations which have reached equilibrium, all of these arrays 
and hence their composite are of the type Cq-!(1—q)-. The dominance 
ratio is 1/5 in so far as dominance is complete. FIsHER gives the value as 
3/13 =0.23 for the case “‘when in the absence of selection, sufficient mu- 
tation takes place to counteract the effect of random survival.” The differ- 
ence from the value 0.20 given above is due solely to the difference in the 
formula for the curve, discussed earlier. 

In the case of the isolation of a small part of a large population, the dom- 
inance ratio takes the value 1/3 in so far as dependent on dominant type 
genes in equilibrium with recessive mutation but not with reverse muta- 
tion (y=C/(1—q)). Where following isolation both fixation and loss are 
substantially irreversible (y=1) the dominance ratio is 1/4 in agreement 
with FIsHER’s result. In both of these cases, of course, the dominance ratio 
falls to 1/5 when equilibrium is finally attained. 

The foregoing discussion applies practically only to very small com- 
pletely isolated populations. In large populations where the distribution 
of gene frequencies, even in partially isolated subgroups, tends to approach 
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the normal type, the dominance ratio comes to depend mainly on the mean 
gene frequency which depends on the relation of selection to mutation, or 
on selection against both homozygotes in favor of heterozygotes. In the ex- 
treme case in which the gene frequency is reduced to a single value, the dom- 
inance ratio is q/(2—q). Values close to unity should not be uncommon, 
especially where gene frequency is controlled by the balance of selection 
and mutation. Such a dominance ratio has rather surprising effects on the 
correlation between relatives. The correlation between parent and off- 
spring approaches 0 although that between brothers may remain as high 
as 0.25. However, the occurrence of an appreciable number of genes at 
lower frequencies, for example, held in equilibrium by selection favoring 
the heterozygote against both homozygotes would greatly lower the dom- 
inance ratio. 

All of these figures are on the assumption that dominance is complete. 
Dominance, however, is frequently not complete. Among 22 heterozy- 
gotes in the guinea pig which have been studied with some care, at least 
9 or about 40 percent are to some extent intermediate. Most of these 
have to do with color characters. It is not unlikely that incomplete dom- 
inance will be found to be even more frequent on careful study of size char- 
acters. 

FIsHER (1922) comes rather definitely to the conclusion that the dom- 
inance ratio is typically in the neighborhood of 1/3. This was based pri- 
marily on a distribution of factor frequencies which he reached for the 
case of selection against a recessive,'* with which the results of the present 
study are not at all in agreement. He also finds, however, that the differ- 
ences between fraternal and parent-offspring correlations in data which 
he analyzes indicate the same figure. The analysis of a large number of 
correlations of these sorts would undoubtedly furnish valuable informa- 
tion with regard to the statistical situation in populations. It is to be noted, 
however, that similarity in the environment of brothers as compared 
with parent and offspring may also contribute to a higher fraternal cor- 
relation and that in any case one cannot reason from the dominance ratio 
deduced from correlations to the distribution of factor frequencies with- 
out making some assumption as to the prevalence of dominance. 

About all that seems justified by the present analysis, is the statement 
that for permanently small populations under low selection the value should 

3 This was given as df a —S __ or ¢(q)=Cq-(1—q)~ on transformation of scale. 

sin 40 cos’ 36 


FISHER does not discuss dominance ratio in connection with his recent revision of his results in 
The genetical theory of natural selection, 1930. 
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be less and probably considerably less than 0.20 but that this figure may 
be raised by severe selection (favoring dominants) and especially by in- 
crease in size of population. It may even approach unity in very large 
populations under severe selection, if complete dominance is the rule. 


Mean and variability of characters 


In the case of genes which are indifferent to selection (s less than u), 
the mean frequency q = v/(u+v) remains unchanged through all transfor- 
mations from a U-shaped distribution in small populations to an I-shaped 
one in large populations. The variance, due to such genes, is small in 
small populations, rises in nearly direct proportion to size of population 
up to a certain critical point (about N=1/4u) and then approaches a 
limiting value. For the case in which mutations in one direction (u) occur 
at a much greater rate than in the other (v), the general formula reduces 
a | 4Nu 
“\1+4Nu 
This case is illustrated in figure 15. The dotted lines represent mean gene 
frequencies and the line of dashes the variance. 

Actual changes in the size of a given population are not of course accom- 
panied by instant adjustment of the distribution of gene frequencies. A 
decrease in size to a point well below the critical value is followed by de- 
crease in heterozygosis and variance at a rate between 1/4N and 1/2N per 
generation depending on the number of allelomorphs. This may be a fairly 
rapid process in terms of geologic time but the recovery of heterozygosis 
through growth of the population to its original size occurs more slowly, 
since this depends on mutation pressure. On the other hand, the inter- 
crossing of a number of isolated strains, in each of which the reduction of 
variance has occurred, is followed by immediate recovery of the original 
statistical situation (except with respect to factor combinations in which 
there is some delay). 

In the opposite case of genes under vigorcus selection (s much greater 
than u) mean frequency as well as variance is affected by size of popula- 
tion and by severity of selection. As in the preceding case, variance is 
small in small populations, rises in nearly direct proportion to growth 
of population" until a critical point is approached (here about N = 1/4s) 


to o?=¢ > in which o..2(=2La?v/u) is the limiting value. 


e4Ns— 1 
and then rapidly approaches a limiting value o?=o,,2 |————— } where 
e4Ns + 1 
4 As before, the probability that the increase in variance, due to growth of population, would 
react on the selection coefficients of the individual genes, reducing them, requires some qualifica- 
tion of this statement in application to actual populations. 
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Ficures 15 to 17.—The variance (o*) and mean gene frequencies (dotted lines) at equilibrium 
under various conditions of mutation, selection and size of population. Figure 15. Effects of 
increasing population where selection is negligible relative to mutation. Figure 16. Effects 
of increasing population where mutation rates are small compared with 1/4N. Figure 17. 


Effects of increasing selection where mutation rates are small compared with 1/4N. 
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go,” = wa?/s. The mean factor frequency in large populations 
(q=1-—u/s) is close to 1, nearly complete fixation of the favorable gene. 
In small populations, on the other hand, the equilibrium point ap- 
proaches that of the opposing mutation pressures (q=v/(u+v)) and 
hence practically 0, with complete loss (v=0) as the inevitable ulti- 
mate fate in an extended multiple allelomorphic series. Up to the point 
at which mutation pressure seriously disturbs the form of the distribution 
curve, the mean gene frequencies are simply the ratios of the chances of 
fixation at each extreme, namely, ve***/(u+ve*™). 

The relations of mean frequency and variance to size of population in 
this case are both shown in figure 16, the former for various relative val- 
ues of u and v. Inspection of figures 9 and 10 may also be of assistance in 
understanding this situation. 

As in the other case, actual change in size of population is not accom- 
panied by immediate attainment of the new equilibrium. Decrease in popu- 
lation to a number well below the critical point is followed by decrease in 
heterozygosis at the rate described, bringing with it at the same rate 
the well known inbreeding effects, loss of variance and, in general, decline 
in vigor toward a new level. This immediate decline in vigor is not due to 
change in mean gene frequency, but merely to the greater proportion of 
recessive phenotypes as homozygosis increases, and thus comes to an end 
when the degree of homozygosis has reached equilibrium. The change in 
mean gene frequency proceeds more slowly since it depends on mutation 
pressure. Long continued isolation should thus involve two distinct de- 
generation processes, a rapid but soon completed process of fixation and 
a very slow process of accumulation of injurious genes. The recovery 
on increase in size of population is slow in both cases, depending on mu- 
tation pressure. The intercrossing of isolated lines, on the other hand, is 
followed by immediate return to the original status of the population if 
only the immediate inbreeding effect has occurred, but must wait on favor- 
able mutations if there has been time for the slower process. 

The effects of different intensities of selection on mean gene frequency 
and variance (population size constant) are illustrated in figure 17, still 
assuming that the selection coefficient is of higher order than mutation rate. 
Figures 7 and 8 showing the distribution of gene frequencies in this case 
may also be of assistance here. Selection has little effect on variability 
until it reaches about the value 1/8N, about half the variance is elimi- 
nated when selection reaches 1/N and most of it ats=4/N. The formula 


; ens — 1 ; 
is o? =¢,? (Geeewsy): Selection, of course, affects the mean gene 
s(etNs 
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frequency, the formula being the same as that given above under the effect 
of size of population. On actual increase in the intensity of selection, the 
rate of change toward the new equilibrium both in mean and variance is 
controlled by selection pressure and may thus be fairly rapid in terms of 
geologic time in a large population. This is the case in which HALDANE’s 
formulae for progress under selection are most applicable. The increase in 
variance and in the proportion of unfavorable genes following relaxa- 
tion of selection, on the other hand, are controlled by mutation pressure 
and thus approach equilibrium relatively slowly. A shift in gene frequency 
at rate uq may well mean no more than 0.000,001 per generation. 

The type of result where the selection coefficient is of the same order of 
magnitude as mutation rate can be inferred, qualitatively, at least from 
the preceding extreme cases. Inspection of figure 19 may also be of as- 
sistance here. 


THE EVOLUTION OF MENDELIAN SYSTEMS 
Classification of the factors of evolution 


In attempting to draw conclusions with respect to evolution one is apt, 
perhaps, to assume that factors which make for great variation are neces- 
sarily favorable while those which reduce variation are unfavorable. Evo- 
lution, however, is not merely change, it is a process of cumulative change: 
fixation in some respects is as important as variation in others. Live stock 
breeders like to compare their work to that of a modeller in clay. They 
speak of moulding the type toward the ideal which they have in mind. 
The analogy is a good one in suggesting that in both cases it is a certain 
intermediate degree of plasticity that is required. 

The basic cumulative factor in evolution is the extraordinary persis- 
tence of gene specificity. This doubtless rests on a tendency to precise 
duplication of gene structure in the proper environment. The basic change 
factor is gene mutation, the occasional failure of precise duplication. Since 
the time of LAMARCK, a school of biologists have held that the primary 
changes in hereditary constitution must be adaptive in direction in order 
to account for evolutionary advance. Unfortunately, the results of experi- 
mental study have given no support to this view. Instead, the character- 
istics of actually observed gene mutations seem about as unfavorable as 
could be imagined for adaptive evolution. In the first place, is their fortui- 
tous occurrence. No correlation has been found between external condi- 
tions and direction of mutation, and those fewagents which have been found 
to affect the rate (X-ray, radium, and to a relatively unimportant extent, 
temperature) merely speed up the rate of random mutation. The great 
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majority of mutations are either definitely injurious to the organism or 
produce such sma!l effects as to be seemingly negligible. MULLER has 
graphically compared the range of mutations to a spectrum in which 
the nonlethal conspicuous mutations form a narrow field between broad 
regions of individually inconspicuous mutations on the one hand and of 
sublethal and lethal mutations on the other. In addition, the great ma- 
jority of mutations are more or less completely recessive to the type genes 
from which they arise. These effects are easily understood if mutation is 
an accidental process. Random changes in a complex organization are 
more likely to injure than to improve it, and with respect to the immediate 
products of the gene, random changes are more likely to be of the nature 
inactivation (and hence probably recessive) than of increased activation. 
Finally is to be mentioned the extreme rarity of gene mutation. Even in 
Drosophila, mutation rates'as high as u=10-* per locus seem to be excep- 
tional and 10-* or less more characteristic. This infrequency seems un- 
favorable to rapid evolution, yet it is a necessary corollary of the usually 
injurious effect, if life is to persist at all. Moreover, the more advanced the 
evolution, the slower must be the time rate of mutation. In one-celled or- 
ganisms, dividing several times a day, a rapid time rate of mutation will 
not prevent the production of sufficient normal offspring to maintain the 
species. The same time rate in Drosophila with an interval of some two 
weeks between generations would mean such an accumulation of lethals 
in every gamete that the species would come to an abrupt end. The time 
rate of lethal mutation in Drosophila (7 per 1000 chromosomes per month 
under ordinary conditions according to MULLER (1928)) would be quite 
impossible in the human species. The problem is to determine how an adap- 
tive evolutionary process may be derived from such unfavorable raw mater- 
ial as the infrequent, fortuitous and usually injurious gene mutations. 

It will be convenient here to classify factors of evolution according as 
they tend toward genetic homogeneity or heterogeneity of the species. 
They are grouped below in more or less definitely opposing pairs. 


Factors of Genetic Homogeneity Factors of Genetic Heterogeneity 
Gene duplication Gene mutation (u, v) 
Gene aggregation Random division of aggregate 
Mitosis Chromosome aberration 
Conjugation Reduction (meiosis) 
Linkage Crossing over 
Restriction of population size (1/2N) Hybridization (m) 
Environmental pressure (s) Individual adaptability 
Crossbreeding among subgroups (m:) Subdivision of group (1/2N;) 
Individual adaptability Local environments of subgroups (s:) 
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The first pair have been discussed above. They enter into the formulae 
through the mutation rates u and v. MULLER has pointed out the neces- 
sary similarity, in order of size, of genes and of filterable viruses and has 
suggested the possibility that the latter may consist of single genes. If so, 
their evolution restswholly on a not too high rate of mutation, and selection, 
which seems possible enough in organisms as simple as these presum- 
ably are, especially as in this case where the gene is the organism, the mu- 
tation of the gene need not be expected to be as fortuitously related to 
the activities of the organism as in more complex cases. 

Presumably the first step toward higher organisms is the aggregation 
of such genes with multiplication of the aggregate by random division. 
Given occasional gene mutation, this leads.to a new kind of variation, that 
in proportional abundance of the different kinds of genic material. The 
larger the aggregate, the less violent the variation. Large aggregates pre- 
sent alabile system capable of quantitative variation in response (perhaps 
physiologically as well as through selection) to changing conditions. As far 
as observation goes, the bacteria, and blue green algae have no mechanism of 
division beyond a random division of the protoplasmic constituents. Such 
apportionment of more or less autonomous materials may also be impor- 
tant in the differentiating cell lineages of multicellular organisms, but, ex- 
cept for a few plastid characters,seems to play no important role in heredity 
from generation to generation, as far as has been determined by experi- 
ment. There seems here an adequate basis for an evolutionary process 
in organisms so simple that the handing on of a few different proto- 
plasmic constituents can determine all of the characteristics of the species 
but the conditions are not favorable for an extensive cumulative process. 

Mitosis provides the mechanism by which an indefinitely large number 
of qualitatively different elements may be maintained in the same pro- 
portions. But it provides so perfectly for the persistence of complex 
organization that further change is difficult. Irregularities in mitosis pro- 
vide a source of variation but of so violent a nature for the most part as 
to be of infrequent evolutionary importance, although the differences in 
chromosome numbers of related species demonstrate that they play a 
genuine réle. Complete duplication (tetraploidy) is important in doubling 
the possible number of different kinds of genes. Other aberrations, es- 
pecially translocations, are probably more important in isolating types, 
than for the character changes which they bring. Gene mutation remains 
the principal factor of variation, but seems inadequate as the basis of an 
evolutionary process under exclusively mitotic (asexual) reproduction. 

The most important factor in transcending the evolutionary difficulties 
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inherent in the characteristics of gene mutation is undoubtedly the attain- 
ment of biparental reproduction (East 1918). This involves two phases, 
conjugation, a factor which makes the entire interbreeding group a phys- 
iological unit in evolution, and meiosis, with its consequence, Mendelian 
recombination which enormously increases the amount of variability with- 
in the limits of the species. Each additional viable mutation in an asexu- 
ally reproducing form merely adds one to the number of types subject 
to natural selection. The chance that two or more indifferent or injurious 
mutations may combine in one line to produce a possibly favorable 
change is of the second or higher order. Under biparental reproduction, 
each new mutation doubles the number of potential variations which may 
be tried out. The contrast is between n+1 and 2" types from n viable 
mutations. 

Biparental reproduction solves the evolutionary requirement of a rich 
field of variation. But by itself it provides rather too much plasticity. It 
makes a highly adaptable species, capable of producing types fitted to each 
of a variety of conditions, but a successful combination of characteristics 
is attained in individuals only to be broken up in the next generation by 
the mechanism of meiosis itself. 

An excellent illustration of the principle that a balance between factors 
of homogeneity and of heterogeneity may provide a more favorable con- 
dition for evolution than iether factor by itself may be found in the effects 
of an alternation of a series of asexual generations with an occasional sex- 
ual generation. Evolution is restrained under exclusive asexual reproduc- 
tion by the absence of sufficient variation, and under exclusive sexual 
reproduction by the noncumulative character of the variation, but, on 
alternating with each other, any variety in the wide range of combinations 
provided by a cross may be multiplied indefinitely by asexual reproduc- 
tion. The selection of individuals is replaced by the much more effec- 
tive selection of clones and leads to rapid statistical advance which, 
however, comes to an end with reduction to a single successful clone. On 
the other hand a new cross (before reduction to a single clone) may pro- 
vide a new field of variation making possible a repetition of the process at 
a higher level. This method has been a favorite of the plant breeder and 
is perhaps the most successful yet devised for human control of evolution 
in those cases to which it can be applied at all. Under natural conditions, 
alternation of asexual and sexual reproduction is characteristic of many 
organisms and doubtless has played an important rdle in their evolution. 

The demonstration of the evolutionary ‘advantagesjof an alternation of 
the two modes of reproduction seems to prove too much. Asexual re- 
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production is practically absent in the most complex group of animals, the 
vertebrates, and is rather sporadic in its occurrence elsewhere. The pur- 
pose of the present paper has been to investigate the statistical situation 
in a population under exclusive sexual reproduction in order to obtain a 
clear idea of the conditions for a degree of plasticity in a species which may 
make the evolutionary process an intelligible one. 

First may be mentioned briefly a modification of the meiotic mechan- 
ism which has been intreduced only qualitatively into the investigation 
where at all. This is the aggregation of genes into more or less persistent 
systems, the chromosomes. Complete linkage cuts down variability by 
preventing recombination. Wholly random assortment gives maximum 
recombination but does not allow any important degree of persistence of 
combinations once reached. An intermediate condition permits every com- 
bination to be formed sooner or later and gives sufficient persistence of 
such combinations to give a little more scope to selection than in the case 
of random assortment. Close linkage, moreover, brings about a condition 
in which selection tends to favor the heterozygote against both homozy- 
gotes and so helps in maintaining a store of unfixed factors in the popula- 
tion. 

Resrtiction of size of population, measured by 1/2N, is a factor of homo- 
geneity and conversely with increase of size. The effects of restricted size 
may also be balanced by those of occasional external hybridization, mea- 
sured by m. 

Environmental pressure on the species as a whole is a factor of homo- 
geneity. It has been urged by some that because natural selection is a 
factor which reduces variability, and most conspicuously by eliminating 
extreme types, it cannot be the guiding principle in adaptive evolution. 
From the viewpoint of evolution as a moving equilibrium, however, the 
guiding principle may be found on the conservative as well as on the radi- 
cal side. The selection coefficient, s, depends on the balance between en- 
vironmental pressure and individual adaptability. High development of 
the latter permits the survival of genetically diverse types in the face of 
severe pressure. 

Subdivision of a population into almost completely isolated groups, 
whether by prevailing self fertilization, close inbreeding, assortative mat- 
ing, by habitat or by geographic barriers is a factor of heterogeneity with 
effects measured by 1/2N,, N; being here the size of the subgroup. This 
factor may be balanced by crossbreeding between such groups, measured 
by m. 
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It is interesting to note that restriction of population size is a factor of 
homogeneity or of heterogeneity for the species, depending on whether it 
relates to the species as a whole or to subgroups and conversely with the 
crossbreeding coefficient. Similarly the selection pressures of varied en- 
vironments within the range of the species (s:) constitute factors of hetero- 
geneity, restrained from excessive genetic effect by the same individual 
adaptability which appears in the opposite column in relation to the 
general environment of the species. Individual adaptability is, in fact, 
distinctly a factor of evolutionary poise. It is not only of the greatest signi- 
ficance as a factor of evolution in damping the effects of selection and keep- 
ing these down to an order not too great in comparison with 1/4N and u, 
but is itself perhaps the chief object of selection. The evolution of complex 
organisms rests on the attainment of gene combinations which determine 
a varied repertoire of adaptive cell responses in relation to external con- 
ditions. The older writers on evolution were often staggered by the seeming 
necessity of accounting for the evolution of fine details of an adaptive 
nature, for example, the fine structure of all of the bones. From the view 
that structure is never inherited as such, but merely types of adaptive cell 
behavior which lead to particular structures under particular conditions, 
the difficulty to a considerable extent disappears. The present difficulty 
is rather in tracing the inheritance of highly localized structural details to 
a more immediate inheritance of certain types of cell behavior. 


Lability as the condition for evolution 


The statistical effects of the more important of these factors in a freely 
interbreeding population are brought together in the formula 


y= Ce‘Nsag—(1 a#4 q)4Ne-1, 


The term 4Nv in the exponent of q is here assumed to be negligible and 
the terms applicable in case of external hybridization are also omitted. 

Consider first the situation in a small population in which 1/4N is much 
greater than u and than s (figure 18). Nearly all genes are fixed in one 
phase or another. Even rather severe selection is without effect. There is 
no equilibrium for individual genes. They drift from one state of fixation to 
another in time regardless of selection, but the rate of transfer is extremely 
slow. Such evolution as there is, is random in direction and tends toward 
extinction of the group. 

Consider next the opposite extreme, a very large undivided popula- 
tion under severe selection. Assume that s is in general much greater than u 
and that the latter is much greater than1/4N. There is almost complete 
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Ficures 18 to 21.—Distributions of gene frequencies in relation to size of population, selection, 
mutation and state of subdivision. Figure 18. Small population, random fixation or loss of genes 
(y=Cq"“(1—q)™. Figure 19. Intermediate size of population, random variaton of gene frequen- 
cies about modal values due to opposing mutation and selection (y= CetNsaq-1(4 —q)sNu-1, 
Figure 20. Large population, gene frequencies in equilibrium between mutation and selection 
(q=1—u/s, etc.). Figure 21. Subdivisions of large population, random variation of gene frequen- 


cies about modal values due to immigration and selection. (y= Ce*Ns9¢Nmam-1(4 — q)¢Nm(1—am)-1, 
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fixation of the favored gene for each locus. Here also there is little pos- 
sibility of evolution. There would be complete equilibrium under uniform 
conditions if the number of allelomorphs at each locus were limited. With 
an unlimited chain of possible gene transformations, new favorable mu- 
tations should arise from time to time and gradually displace the hitherto 
more favored genes but with the most extreme slowness even in terms of 
geologic time. 

Even if selection is relaxed to such a point that the selection coefficients 
of many of the genes are not much greater than mutation rates, the con- 
ditions are not favorable for a rapid evolution (figure 20). The amount 
of variability in the population may be great, maximum in fact, but if the 
distributions of gene frequencies are closely concentrated about single 
values, the situation approaches one of complete equilibrium and hence 
of complete cessation of evolution. At best an extremely slow, adaptive, 
and hence probably orthogenetic advance is to be expected from new mu- 
tations and from the effects of shifting conditions. 

It should be added that a relatively rapid shift of gene frequencies can 
be brought about in this case by vigorous increase in the intensity of 
selection. The effects of unopposed selection of various sorts and in vari- 
ous relations of the genes has been studied exhaustively by HALDANE, with 
regard to the time required to bring about a shift of gene frequency of any 
required amount. The end result, however, is the situation previously 
discussed. The rapid advance has been at the expense of the store of varia- 
bility of the species and ultimately puts the latter in a condition in which 
any further change must be exceedingly slow. Moreover, the advance is 
of an essentially reversible type. There has been a parallel movement of 
all of the equilibria affected and on cessation of the drastic selection, mu- 
tation pressure should (with extreme slowness) carry all equilibria back 
to their original positions. Practically, complete reversibility is not to be 
expected, and especially under changes in selection which are more com- 
plicated than can be described as alternately severe and relaxed. Never- 
theless, the situation is distinctly unfavorable for a continuing evolutionary 
process. 

Thus conditions are unfavorable for evolution both in very small and 
in very large, freely interbreeding, populations, and largely irrespective 
of severity of selection. We have next to consider the intermediate situa- 


8 This, nevertheless, seems to be the case which FisHER (1930) considers most favorable to 
evolution. The greatest difference between our conclusions seems to lie here. His theory is one of 
complete and direct control by natural selection while I attribute greatest immediate importance 
to the effects of incomplete isolation. 
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tion in which s is not much greater than u for many genes and the latter 
is not much greater than 1/4N. Such a case is illustrated in figure 19. 
The size of population is sufficient to prevent random fixation of genes, 
but insufficient to prevent random drifting of gene frequencies about their 
mean values, as determined by selection and mutation. It is to be sup- 
posed that the relations of the selection and mutation coefficients vary 
from factor to factor. The more indifferent ones driit about through a 
wide range of frequencies in the course of geologic time while those under 
more severe selection oscillate about positions close to complete fixatiom 
In any case, all gene frequencies are continually changing even under uni- 
form environmental conditions. But the selection coefficients themselves 
are in general to be considered functions of the entire array of gene 
frequencies and will therefore also be continually changing. The prob- 
ability arrays of some genes will travel to the right and close up as their 
selection coefficients stiffen, while some of the genes which have been 
nearly fixed will come to be less severely selected and their probability 
arrays will shift to the left and open out or even move to the extreme left 
under displacement by another allelomorph. A continuous and essentially 
irreversible evolutionary process thus seems inevitable even under com- 
pletely uniform conditions. The direction is largely random over short 
periods but adaptive in the long run. The less the variation of gene fre- 
quency about its mean value, the closer the approach to an adaptive ortho- 
genesis. Complete separation of the species into large subspecies should 
be followed by rather slow more or less closely parallel evolutions, if the 
conditions are similar, or by adaptive radiation, under diverse conditions, 
while isolation of smaller groups would be followed by a relatively rapid 
but more largely nonadaptive radiation. 

As to rate, since the process depends mainly on the value of 1/4N, as- 
sumed to be somewhat less than u (and s) the process cannot be as rapid as 
one due temporarily to either unopposed selection or unopposed mutation 
pressure. Hundreds of thousands of generations seem to be required at best 
for important nonadaptive evolutionary changes of the species as a whole; 
while adaptive advance, depending on the chance attainment of favorable 
combinations would be much slower. Even so the process is much the 
most rapid non-self-terminating one yet considered. 

In reaching the tentative conclusion that the situation is most favorable 
for evolution in a population of a certain intermediate size, one important 
consideration has been omitted. This is the tendency toward subdivision 
into more or less completely isolated subgroups in widely distributed 
populations. Within each subgroup there is a distribution of gene fre- 
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quencies dependent largely on its own size (N:), amount of crossing with 
the rest of the species (m:), selection due to local conditions (s,) and the 
mean gene frequency of migrants from the rest of the species (qm). It 
may be assumed that 1/4N; is so much larger than u that mutation pressure 
(and also the average selection coefficient, s, for the whole species) can be 
ignored. 


y= CetNisiag4Nimam—1(} _ q)4Nim (am) —1 | 


Gene frequency in each subgroup oscillates about a mean value, which 
is that of the whole species only if conditions of selection are uniform. 
Figure 21 represents various cases. The random variations of gene fre- 
quency have effects similar to those described above within each group. 
The result is a partly nonadaptive, partly adaptive radiation among the 
subgroups. Those in which the most successful types are reached presum- 
ably flourish and tend to overflow their boundaries while others decline, 
leading to changes in the mean gene frequency of the species as a whole. In 
this case, the rate of evolution should be much greater than in the previous 
cases. The coefficients 1/4N; and s,; may be relatively large and bring 
about rapid differentiation of subgroups, while the competition between 
subgroups will bring about rapid changes in the gene frequencies of the 
species as a whole. The direction of evolution of the species as a whole will 
be closely responsive to the prevailing conditions, orthogenetic as long as 
these are constant, but changing with sufficiently long continued environ- 
mental change. 

A question which requires consideration is the effect of alternation of 
conditions, large and small size of population, severe and low selection. 
The effects of changes in the conditions of selection have already been 
touched upon. Persistence of small numbers or of severe selection for such 
periods of time as to bring about extensive fixation of factors compromises 
evolution for a long time following, there being no escape from fixation 
except by mutation pressure. Many thousands of generations may be 
required after restoration to large size and not too severe selection, before 
evolutionary plasticity is restored. Short time oscillations in population 
number or severity of selection, on the other hand, probably tend to speed 
up evolutionary change by causing minor changes in gene frequency. 


Control of evolution 


With regard to control of the process, it is evident that little is possible 
either within a small stock or a freely interbreeding large one. Even drastic 
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selection is of little effect in the former, and in the latter, while it may bring 
about a rapid immediate change in the particular respect selected, this 
must be at the expense of other characters, and in any case, soon leads to a 
condition in which further advance must wait on the occurrence of muta- 
tions more favorable than those fixed by the selection. The limitations in 
this case have been well brought out in a recent discussion by Kemp (1929). 
Maximum continuous progress in a homogeneous population requires an 
intensity of selection for each of the more indifferent genes not much 
greater than its mutation rate and also a certain size of population. Even 
so, the direction of advance is somewhat uncertain and the rate to be 
measured in geologic time. 

If infrequency of mutation is the limiting factor here, it would seem that 
a considerable increase in the rate of evolution should be made possible 
by a speeding up of mutation, as by X-rays. There is a limit, however, 
imposed by the prevailingly injurious character of mutations. Even the 
most rigorous culling of individuals means in general, only a low selection 
coefficient (in absolute terms) for each of the presumably numerous unfixed 
genes, which are not in themselves lethal or sublethal in effect. Such cull- 
ing would become insufficient to hold mutation pressure in check when the 
latter had increased beyond a certain point (u>s). Moreover, as the 
number of unfixed genes becomes greater under an increased mutation 
rate, the smaller becomes the separate gene selection coefficients, making 
it certain that mutation rate could not increase very much before the 
possibility of effective selection (in all respects at once) rather than infre- 
quency of mutation would become the limiting factor. With respect to 
lethal mutations, it has already been noted that the observed natural time 
rate in Drosophila is such as would mean immediate extinction, if trans- 
ferred to the human species. It is clear that an evolution in the direction 
of increased gene stability, rather than mutability, has been a necessary 
phase, in the evolution of the longer lived higher animals. This makes it 
unlikely that a general increase in mutation rate would increase the rate of 
evolutionary advance along adaptive lines. 

The only practicable method of bringing about a rapid and non-self- 
terminating advance seems to be through subdivision of the population 
into isolated and hence differentiating small groups, among which selection 
may be practiced, but not to the extent of reduction to only one or two 
types (WRIGHT 1922a). The crossing of the superior types followed by 
another period of isolation, then by further crossing and so on ad infinitum 
presents a system by means of which an evolutionary advance through the 
field of possible combinations of the genes present in the original stock, and 
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arising by occasional mutation, should be relatively rapid and practically 
unlimited. The occasional use of means for increasing mutation rate within 
limited portions of the population should add further to the possibilities of 
this system. 


Agreement with data of evolution 


We come finally to the question as to how far the characteristics of 
evolution in nature can be accounted for on a Mendelian basis. A review 
of the data of evolution would go far beyond the scope of the present paper. 
It may be suggested, however, that the type of moving equilibrium to be 
expected, according to the present analysis, in a population comparable to 
natural species in numbers, state of subdivision, conditions of selection, 
individual adaptability, etc. agrees well with the apparent course of evolu- 
tion in the majority of cases, even though heredity depend wholly on 
genes with properties like those observed in the laboratory. Adaptive 
orthogenetic advances for moderate periods of geologic time, a winding 
course in the long run, nonadaptive branching following isolation as the 
usual mode of origin of subspecies, species and perhaps even genera, adap- 
tive branching giving rise occasionally to species which may originate new 
families, orders, etc.; apparent continuity as the rule, discontinuity the 
rare exception, are all in harmony with this interpretation. 

The most serious difficulties are perhaps in apparent cases of nonadaptive 
orthogenesis on the one hand and extreme perfection of complicated adap- 
tations on the other. In so far as extreme degeneration of organs is con- 
cerned, there is little difficulty—this is to be expected as a by-product of 
other evolutionary changes. Because of their multiple effects, there can 
be no really indifferent genes, whatever may be true of organs which 
have been reduced beyond a certain size. Zero as the value of a selection 
coefficient is merely a mathematical point between positive and negative 
values. It is common observation that mutations are more likely to reduce 
the development of an organ than to stimulate it. It follows that evolu- 
tionary change in general will have as a by product the gradual elimination 
of indifferent organs. Nonadaptive orthogenesis of a positive sort, increase 
of size of organs to a point which threatens the species, constitutes a more 
difficult problem, if a real phenomenon. Probably many of the cases cited 
are cases in which the line of evolution represents the most favorable 
immediately open to a species doomed by competition with a form of 
of radically different type or else cases in which selection based on indi- 
vidual advantage leads the species into a cul-de-sac. The nonadaptive 
differentiation of small subgroups and the great effectiveness of subsequent 
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selection between such groups as compared with that between individuals 
seem important factors in the origin of peculiar adaptations and the attain- 
ment of extreme perfection. It is recognized that there are specific cases 
which seem to offer great difficulty. This should not obscure the fact that 
the bulk of the data indicate a process of just the sort which must be occur- 
ring in any case to some extent as a statistical consequence of the known 
mechanism of heredity. The conclusion seems warranted that the enor- 
mous recent additions to knowledge of heredity have merely strengthened 
the general conception of the evolutionary process reached by DARWIN 
in his exhaustive analysis of the data available 70 years ago. 


““Creative” and “‘ emergent’ evolution 


The present discussion has dealt with the problem of evolution as one 
depending wholly on mechanism and chance. In recent years, there has 
been some tendency to revert to more or less mystical conceptions revolv- 
ing about such phrases as ‘emergent evolution” and “creative evolution.” 
The writer must confess to a certain sympathy with such viewpoints philo- 
sophically but feels that they can have no place in an attempt at scientific 
analysis of the problem. One may recognize that the only reality directly 
experienced is that of mind, including choice, that mechanism is merely a 
term for regular behavior, and that there can be no ultimate explanation 
in terms of mechanism—merely an analytic description. Such a descrip- 
tion, however, is the essential task of science and because of these very 
considerations, objective and subjective terms cannot be used in the same 
description without danger of something like 100 percent duplication. 
Whatever incompleteness is involved in scientific analysis applies to the 
simplest problems of mechanics as well as to evolution. It is present in 
most aggravated form, perhaps, in the development and behavior of 
individual organisms, but even here there seems to be no necessary limit 
(short of quantum phenomena) to the extent to which mechanistic analy- 
sis may be carried. An organism appears to be a system, linked up in such 
a way, through chains of trigger mechanisms, that a high degree of 
freedom of behavior as a whole merely requires departures from regularity 
of behavior among the ultimate parts, of the order of infinitesimals raised 
to powers as high as the lengths of the above chains. This view implies 
considerable limitations in the synthetic phases of science, but in any case 
it seems to have reached the point of demonstration in the field of quantum 
physics that prediction can be expressed only in terms of probabilities, 
decreasing with the period of time. As to evolution, its entities, species and 
ecologic systems, are much less closely knit than individual organisms. 
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One may conceive of the process as involving freedom, most readily trace- 
able in the factor called here individual adaptability. This, however, is a 
subjective interpretation and can have no place in the objective scientific 
analysis of the problem. 


SUMMARY 


The frequency of a given gene in a population may be modified by a 
number of conditions including recurrent mutation to and from it, migra- 
tion, selection of various sorts and, far from least in importance, mere 
chance variation. Using q for gene frequency, v and u for mutation rates 
to and from the gene respectively, m for the exchange of population with 
neighboring groups with gene frequency qm, s for the selective advantage 
of the gene over its combined ailelomorphs and N for the effective number 
in the breeding stock (much smaller as a rule than the actual number of 
adult individuals) the most probable change in gene frequency per genera- 
tion may be written: 


Aq = v(1 — q) — uq — m(q — qu) + sq(1 — q) 


and the array of probabilities for the next generation as [(1—q—Agq)a 
+(q+Aq)A}*®. The contribution of zygotic selection (reproductive rates 
of aa, Aa and AA as 1—s!:1—hs!:1) is Aq=s'q(1—q) [1—q+h(2q—1)]. 
In interpreting results it is necessary to recognize that the above coeffi- 
cients are continually changing in value and especially that the selection 
coefficient of a particular gene is really a function not only of the relative 
frequencies and momentary selection coefficients of its different allelo- 
morphs but also of the entire system of frequencies and selection coeffi- 
cients of non-allelomorphs. Selection relates to the organism as a whole 
and its environment and not to genes as such. The mutation rate to a gene 
(v) can usually be treated as of negligible magnitude assuming the preva- 
lence of multiple allelomorphs. 

In a population so large that chance variation is negligible, gene fre- 
quency reaches equilibrium when Aqg=0. Among special cases is that of 


Vv 
opposing mutation rates (q = a , of selection against both homo- 
v 
1—h 
1—2h 





u 
zygotes (q= , of mutation against genic selection (q=1——), 
ys q g g q " 


u 
of mutation against zygotic selection (q= Sa unless h approaches 0, 
s 


u m 
when q=1- /~\ of selection and migration (q=1——(1—qm)) or 
s S 


Genetics 16: Mr 1931 











































SEWALL WRIGHT 


qm 





s 
if s is much greater than m, q=qm (1 +—(1—qn) ) ifs is much 
s m 





smaller than m, while the values q=1/qm or 1—V1—qnm when s= +m 
illustrate the intermediate case). 

Gene frequency fluctuates about the equilibrium point in a distribution 
curve, the form of which depends on the relations between population 
number and the various pressures. The general formula in the case of a 
freely interbreeding group, assuming genic selection, is 


y= CetNsag4Nimamtv]—1(4 el q)4NimG-am)+u]—1 | 


The correlation between relatives is affected by the form of the distribu- 
tion of gene frequencies through FIsHER’s ‘“‘dominance ratio.” It appears 
that this is less than 0.20 in small populations under low selection but may 
even approach 1 in large populations under severe selection against re- 
cessives. 

In a large population in which gene frequencies are always close to their 
equilibrium points, any change in conditions other than population num- 
ber is followed by an approach toward the new equilibria at rates given 
by the Aq’s. Great reduction in population number is followed by fixation 
and loss of genes, each at the rate 1/4N per generation, where N refers to 
the new population number. This applies either in a group of monoecious 
individuals with random fertilization or, approximately, in one equally 
divided between males and females (9.6 percent instead of 12.5 percent, 
however, under brother-sister mating, N=2). More generally with an 
effective breeding stock of N,, males and N; females, the rates of fixation 
and of loss are each approximately (1/16N..+1/16N;) until mutation 
pressure at length brings equilibrium in a distribution approaching first 
the form y = C(1—q)- with decay at rate u. and ultimately Cq-1(1—q)-. 
The converse process, great increase in the size of along inbred population, 
is followed by a slow approach toward the new equilibrium at a rate de- 
pendent in the early stages on mutation pressure. 

With respect to genes which are indifferent to selection, the mean fre- 
quency is always q=v/(u+v). The variance of characters, dependent on 
such genes, is proportional (at equilibrium) to population number up to 
about N=1/4u. Beyond this, there is approach of variance to a limiting 
value. 

In the presence of selection (s considerably greater than 2u) the mean 
frequency at equilibrium varies between approximate fixation of the 
favored genes (q=1— u/s) in large populations and approximate, if not 
complete, fixation of mutant allelomorphs (q = v/(u+v)) in small popula- 
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tions, the rate of change from one state to the other being the mutation rate 
(u). A consequence is a slow but increasing tendency to decline in vigor 
in inbred stocks, to be distinguished from the relatively rapid but soon 
completed fixation process, described above as occurring at rate 1/2N. 
The variance of characters in this as in the preceding case, is ap- 
proximately proportional to population number up to a certain point 
(N less than 1/4s) and above this rapidly approaches a limiting value. 
Variance is inversely proportional to the severity of selection in large 
populations unless the selection is very slight but in small populations is 
little affected by selection unless the latter is very severe (s greater than 
1/4N). 

Evolution as a process of cumulative change depends on a proper bal- 
ance of the conditions, which, at each level of organization—gene, chro- 
mosome, cell, individual, local race—make for genetic homogeneity or 
genetic heterogeneity of the species. While the basic factor of change—the 
infrequent, fortuitous, usually more or less injurious gene mutations, in 
themselves, appear to furnish an inadequate basis for evolution, the 
mechanism of cell division, with its occasional aberrations, and of nuclear 
fusion (at fertilization) followed at some time by reduction make it possible 
for a relat vely small number of not too injurious mutations to provide an 
extensive field of actual variations. The type and rate of evolution in 
such a system depend on the balance among the evolutionary pressures 
considered here. In too small a population (1/4N much greater than u and 
s) there is nearly complete fixation, little variation, little effect of selection 
and thus a static condition modified occasionally by chance fixation of new 
mutations leading inevitably to degeneration and extinction. In too large 
a freely interbreeding population (1/4N much less than u and s) there is 
great variability but such a close approach to complete equilibrium of all 
gene frequencies that there is no evolution under static conditions. Change 
in conditions such as more severe selection, merely shifts all gene frequen- 
cies and for the most part reversibly, to new equilibrium points in which 
the population remains static as long as the new conditions persist. Such 
evolutionary change as occurs is an extremely slow adaptive process. 
In a population of intermediate size (1/4N of the order of u) there is con- 
tinual random shifting of gene frequencies and a consequent shifting of 
selection coefficients which leads to a relatively rapid, continuing, irre- 
versible, and largely fortuitous, but not degenerative series of changes, 
even under static conditions. The rate is rapid only in comparison with 
the preceding cases, however, being limited by mutation pressure and thus 
requiring periods of the order of 100,000 generations for important changes. 
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Finally in a large population, divided and subdivided into partially isolated 
local races of small size, there is a continually shifting differentiation 
among the latter (intensified by local differences in selection but occurring 
under uniform and static conditions) which inevitably brings about an 
indefinitely continuing, irreversible, adaptive, and much more rapid evolu- 
tion of the species. Complete isolation in this case, and more slowly in the 
preceding, originates new species differing for the most part in nonadaptive 
respects but is capable of initiating an adaptive radiation as well as of 
parallel orthogenetic linés, in accordance with the conditions. It is sug- 
gested, in conclusion, that the differing statistical situations to be ex- 
pected among natural species are adequate to account for the different 
sorts of evolutionary processes which have been described, and that, in 
particular, conditions in nature are often such as to bring about the state 
of poise among opposing tendencies on which an indefinitely continuing 
evolutionary process depends. 
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INTRODUCTION 


In biology and in medicine the finest instruments and the purest chemi- 
cals are used for experimental work, but often the laboratory animals are 
of very impure stock, genetically speaking. In future this will probably 
change, because accuracy of work is increasingly endangered by the use of 
crude living stock. In order to prepare the way in some respects for the use 
of better stock the present investigation was undertaken. 

The two animals to be compared are Mais musculus L., the European 
house mouse which has developed many tame varieties, primarily the 
commonly used albino mouse, and Mus wagneri (EVERSMANN 1848). Mus 
wagneri is, according to SCLATER (1890), the house mouse of all Central 
Asia. As Mus wagneri mongolium, Mus wagneri manchu or Mus gansuensis 
(Kansu is a province of China) it lives in Mongolia, Manchuria and China 
proper, and I have found it to be the house mouse in Peking. The Japanese 
waltzing mouse is a variety of this species as I was able to demonstrate 
long ago (FortuyN 1912) and as has been proved in detail by GaTeEs 
(1926). In the local market in Peking tame varieties of Mus wagneri in 
several colors may be bought, and some of these have found their way to 
Europe and America. On more than one occasion a group of wild Mus 
wagneri has been imported into the United States and has been crossed 
with varieties of Mus musculus. For many years Mus musculus has also 
been crossed with the Japanese waltzing mouse. As it is probable that in 
the laboratories of the world Mus musculus and Mus wagneri will become 
ever more mixed, it was thought to be worth while to study the differences 
between the two species. In this first paper they are compared only in the 
number of their tailrings. Recently, the name Mus wagneri for the Chinese 
house mouse has been replaced by ALLEN (1927) by the name Mus bac- 
trianus, Blyth. It seems to me, in view of the already frequent use of the 
name Mus wagneri, especially in genetical literature, that it ought to be 
retained. 
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MATERIAL 


The material comprises five groups of Mus musculus and three groups of 
Mus wagneri. 

Since the wild Mus musculus is not obtainable in Peking (it is said to 
occur only along the coast of China, imported by ships) I can only refer 
to the small group of 18 mice which I studied in Holland (Fortuyn 1912). 

The other four groups of Mus musculus are all tame albino mice. The 
animal house of the PExinc UNIon MeEpicat COLLEGE provided me with 
400 mice, all of them descendants of 32 animals imported from the RocKE- 
FELLER INSTITUTE, Princeton, New Jersey, by Doctor C. TENBROECK in 
1924. They are indicated here by the general name of “albino Mus muscu- 
lus.”” By selection, as described in a previous article (FortuyN 1928) two 
races were derived from this colony, one with a high number of tailrings, 
called H T M, and one with a low number of tailrings, called L T M. 

The H T M-mice used for the present investigation were all descendants 
of three mice with 212, 220 and 230 tailrings respectively. As two of them 
were brother and sister, they had only four parents with 212, 214, 218 and 
222 tailrings and five grandparents three of which had 205, 222 and 225 
tailrings, while in two of them the number of tailrings was unknown. 

The L T M race started with four mice which after death appeared to 
have 152, 154, 178 and 196 tailrings. Of course, the selection of the last 
one was a mistake, but in the living animal the number of tailrings had 
been thought to be much lower. After two generations four mice from 
among all the descendants of the original four were again selected, this 
time not for the number of tailrings which they happened to have them- 
selves, but for the average number of tailrings of their young. After death 
these four mice appeared to have 148, 152, 157 and 171 tailrings. Their 
descendants are the L T M-mice studied in this paper. 

The fourth group of albino mice of musculus type has a totally different 
origin. In the local market in Peking a group of tame mice of wagneri type 
was bought which produced in time a number of color-varieties of both 
wagneri and musculus type. In this paper only three out of a great number 
of races from this source will be described. One of these is an albino mouse 
of musculus type, called W M, another a spotted black-and-white mouse 
of wagneri type, called B W T W, and the third an albino mouse of wagneri 
type, called W T W. Race W M takes its origin from eight mice, six white 
and two black, chosen for having the type musculus. This choice was very 
superficially made, the animals being selected simply because they had the 
large size of Mus musculus, yet all the descendants were musculus in type. 
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The wagneri type did not appear among them, not even when some of the 
smallest animals were mated together. Of course, in order to get the race 
W M, the black descendants which appeared in the beginning were dis- 
carded. 

From the beginning the black-and-white mice were all of wagneri type 
and I failed to produce a black and white Mus musculus even by breeding 
from the heaviest individuals of B W T W. 

With W T W it was different. Again and again animals selected for 
wagneri type had offspring which clearly belonged to musculus. Finally 
after some generations and after selection based not only upon the type 
of the animals themselves, but also upon the type of their offspring, a race 
with nothing but wagneri mice was obtained. 

The third group of wagneri mice is a group of wild house mice caught in 
Peking. 

METHOD 


In order to determine the number of tailrings of a mouse they were 
counted by means of a hand lens after the death of the animal. On account 
of irregularities in the tailrings the number is never absolutely accurate. It 
may be two more or less than the figure actually noted. In connection with 
the selection of the races H T M and L T M the number of tailrings was 
also counted in the living mouse and sometimes after death was found to be 
incorrect. After the age of five weeks, however, there is no increase in the 
number of tailrings, which is proved for H T M where the tailrings were 
counted in groups of ten mice each of both sexes and of light, medium and 
heavy weight. These mice were from five to fifteen weeks and from four to 
six months old and yet no increase in the number of tailrings was found. 
In mice younger than five weeks the tailrings were never counted. In race 
H.T M a study was also made whether there is a sexual difference in the 
number of tailrings. As 136 male mice had an average of 216.57+0.85 
tailrings and 154 female mice one of 215.33+0.73 the difference between 
the two sexes (1.64+ 1.12) is not significant 


DISCUSSION 


Table 1 gives the number of tailrings counted in eight different groups 
of mice giv ng for each group the total number of mice (n), the average 
number of tailrings and its mean or standard error, the standard deviation 
of the number of tailrings («) and the coefficient of variation (v). 

In comparing the wild Mus musculus with the albino Mus musculus in 
the colony in the animal house, there appears to be no important difference 
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in the average number of tailrings. The difference is 7.36 + 2.64 and, as this 
is less than three times its mean error, it is not significant. BLasrus (1857) 
and others mention 180 tailrings in the wild Mus musculus which probably 
is too low to be the average. 

It is obvious from this table that it has been possible to select from the 
albino mice of the animal house a race H T M with a high number of tail- 
TABLE 1 
Variation of the number of tailrings. 














ee WILD ALBINO Leite 

pron Mus mus- Mus mus- HTM LTM we BWTW wtw Ea fi 
conrans) culus culus - 
118 _ _ om ~ on i oe 3 
123 — — — 2 —_ 2 1 5 
128 _ _— — 6 —_ 6 _ 11 
133 — —_ — 10 —_ 24 1 22 
138 _ _ — 27 —_ 18 7 22 
143 — 1 —_— 26 _ 12 8 24 
148 _ 1 — 22 — 17 27 17 
153 —_— 3 — 13 3 21 31 7 
158 — 7 —_ 4 3 18 29 1 
163 — 7 — 1 6 6 21 1 
168 — 15 — — 1 6 25 _— 
173 —_ 19 — — 5 8 13 -- 
178 2 26 — — 14 3 8 = 
183 1 55 — — 15 4 6 —_ 
188 3 71 1 — 30 + 1 — 
193 1 66 8 — 34 — —_— _ 
198 3 50 9 — 37 — 1 — 
203 3 50 19 — 32 — oo -~ 
208 3 20 40) — 20 — — _ 
213 2 4 55 — 15 —_— — — 
218 <m 5 60 — 11 — — _ 
223 _ — 4d — 4 _ _— —_ 
228 — _— 40 — _— — —_ _ 
233 — — 10 — —_ -- — —_ 
238 —_— — 3 = — a oo = 
243 — —_ 1 —_— — —_— — —_ 

n 18 400 290 i11 230 149 179 113 

Average 197.01 189.38 216.25 143.19 195.12 150.10 159.25 | 138.49 

$2.57 +0.62 +0.56 +0.76 +0.93 +1.26 +0.87 | +0.82 

o +10.90 | +12.57 +9.61 +8.05 | +14.01 | +15.46 | +11.76 | +8.77 

Vv 5.53 6.63 4.44 5.62 7.18 10.29 7.38 6.33 
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rings (average 216.25+0.56) and a race L T M with a low number of tail- 
rings (average 143.19+0.76). There is a wide gap between the high 
extreme of L T M (164) and the low extreme of H T M (187). Whether 
the colony of albino mice contains more of such races has not been studied. 
As yet, there are no indications that either H T M or L T M may be 
further purified in regard to the number of tailrings, but this has not been 
studied systematically. The coefficient of variation, especially of H T M, 
is low. The hybridisation of the two races is a point which is left to the 
future. 

The wild Mus wagneri has a much lower number of tailrings than the 
wild Mus musculus. The average of 138 found by me closely corresponds 
to the number of about 130 given in the diagnosis by EvERSMANN (1848). 
In view of the coefficient of variation, 6.33, which is nearly the same as in 
the unselected albino mouse (6.63), it seems probable that this wild species 
contains two or more races in regard to the number of tailrings. The 
striking diversity in hair-color in the wild Mus wagneri also suggests 
genetical differences. It is interesting to note that the number of tailrings 
in L T M, a pure Mus musculus even of a very heavy type, is nearly the 
same as in the wild Mus wagneri, which on the average has about half 
its body weight. The extremes are nearly coincident and the averages 
do not differ much, although the difference is significant (4.70+1.11). 

On the other hand the race W M, derived from hybrids of wagneri 
and musculus and selected for musculus type, but not for the number of 
tailrings shows little difference from the albino musculus of the animal 
house, although here again the difference has significance (5.74+1.11). 

In B W T W we have a race selected for wagneri type, but not for the 
number of tailrings, which in regard to the latter is intermediate between 
wagneri and musculus. The high coefficient of variation (10.29) suggests 
that it contains several races with different numbers of tailrings. 

W T W, the tame albino Mus wagneri, selected for wagneri type, but 
not for tailrings, has an average number of tailrings which is somewhat 
higher even than the number of tailrings in the black-and-white variety. 
Variation is less than in B W T W, but still great enough to justify the 
assumption that W T W also contains different races in regard to the 
number of tailrings. 

From table 1 and especially from the fact that L T M is a very heavy 
race, often reaching a weight of over 40 grams, it may be concluded that 
mice of musculus type may have any number of tailrings proper either to 
Mus musculus or to Mus wagneri. On the other hand, mice of wagneri 
type are not known to have as high numbers of tailrings as musculus may 
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have. In this respect there is a certain correlation between the number of 
tailrings and body-size. 

The correlation between the number of tailrings and the tail length 
has been worked out in the case of the unselected albino Mus musculus 
of the animal house. In 340 out of the total number of 400 mice in which 
the number of tailrings was counted the length of the tail was measured 
in millimeters. To simplify the computations for this correlation the mice 
were grouped in classes differing by five tailrings and by three millimeters 
in tail length. Care was taken to have the average of each character in 
the centre of a class. In this way the following correlation table was 
obtained (table 2). The coefficient of correlation, r, becomes 0.27 +0.05 
and is therefore low. This result agrees with the fact that after an age of 


TABLE 2 


Correlation between the number of tailrings and the taillength in the albino Mus 
musculus. 

























































































TAIL-LENGTH NUMBER OF TAILRINGS (cLass CENTERS) 
IN MM TOTAL 
(CLASS CENTERS) 144 | 149 | 154 150 | 164 169 | 174 | 179 184 199 194 | 199 | 204 | 209 | 214 | 219 
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98 i le a 1ji1fit—-f-jal2te 6 
101 eer rrr) 1 1 
Total | 2|2|5 7 |16 |14 {29 |ss Joo |as |s1 [3s f1o | 4 | 1 340 
r=0.27+0.05 


five weeks the number of tailrings does not increase, whereas the tail still 
grows and in its growth is affected by the temperature of the environment 
as shown by SUMNER (1909). It also proves that in general fewer rings 
mean broader rings. Indeed, it is possible to distinguish between the mice 
of H T M and L T M by merely noting whether their tailrings are narrow 
or broad, without counting their number. 

The correlation between the number of tailrings and the number of 
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tail vertebrae has been investigated in the unselected tame albino Mus 
wagneri, W T W. According to GaTEs (1926) the number of tail vertebrae 
in Mus musculus is 26-28, in Mus wagneri 19-20 and in the waltzing 
mouse, which is probably more nearly like W T W in this respect than the 
others, it is 21-22. I found in 129 mice of the race W T W 22-26 tail 
vertebrae with an average of 23.76 +0.07. 

Within this group the correlation between the number of tailrings and 
the number of tail vertebrae may be seen from table 3. Here again, to 
reduce the computations, the mice were grouped in classes differing by 


TABLE 3 
Correlation between the number of tailrings and the number of tail vertebrae in the 
tame albino Mus wagneri, W T W. 



















































































NUMBER NUMBER OF TAILRINGS (cLAss CENTERS) 
OF TAIL- TOTAL 
VERTEBRAE 125 130 | 135 140| 145 150 | 155 10 | 105 170 175 180 185 100 | 195 | me 
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Total }1}o}2 4 |10 [1s [2s |22 |14 |13 int x 
r=0.43+0.07 


five tailrings, care being taken to have the average of the number of tail- 
rings in the centre of a class. The tail vertebrae were counted by means of 
a hand lens after the tail of the mouse had been cut off through the root 
and skinned. It is often not easy to count the very small and poorly 
developed vertebrae in the tip of the tail. The coefficient of correlation 
between the number of tailrings and the number of tail vertebrae appears 
to be r=0.43+0.07. This is high enough to suggest that selection for 
the number of tail vertebrae may lead to a reduction in the variability 
of the number of tailrings. 


SUMMARY 


In eight different groups of mice, some belonging to the species Mus 
musculus, others to Mus wagneri, the number of tailrings was studied. 

In the albino. Mus musculus in the animal house of PEKING UNION 
MEDICAL COLLEGE, imported from the United States of America, the 
number of tailrings was practically the same as in the wild Mus musculus 
of Holland. 
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By selection from the mice in the animal house two races were obtained, 
one with a high number of tailrings and one with a low number. 

The race with the low number of tailrings had approximately the same 
number as the wild Mus wagneri, although its body weight was nearly 
twice as much. 

From tame hybrids of Mus wagneri and Mus musculus, bought in the 
local market in Peking, three groups of mice were obtained. One group,— 
albino mice of musculus type—had practically the same number of tail- 
rings as the albino Mus musculus of the animal house. Another group of 
albino mice and also a group of black-and-white mice, both of wagneri 
type, had a number of tailrings intermediate between the typical Mus 
musculus and Mus wagnert. 

The number of tailrings showed no sexual difference. It did not increase 
after the age of five weeks, although the tail itself continued to grow for a 
long time. In connection with this the coefficient of correlation between 
the number of tailrings and the tail length is low. In general, therefore, 
fewer rings mean broader rings. The coefficient of correlation between 
the number of tailrings and the number of tail vertebrae is rather high. 
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In order to compare the body weight of Mus musculus and Mus wagneri 
in three of the races studied for the number of their tailrings and described 
in the first paper of this series the weight was determined from birth to 
the age of six months. On the first, the eighth and the fifteenth day the 
weight was measured in tenths of grams, for animals of 3 weeks in half 
grams and for older animals of 4 to 15 weeks and of 4 to 6 months, in 
grams. No pregnant females were used. 

The three races used for this investigation were (a) the tame albino 
Mus wagneri, W T W, (b) the tame albino Mus musculus, W M, both 
derived from hybrids of wagneri and musculus, bought in the local market 
in Peking, but carefully selected, the one for wagneri type, the other for 
the type of Mus musculus, and (c) the albino Mus musculus, H T M. 
The latter is a pure Mus musculus imported from the United States of 
America, but selected for the high number of tailrings. Tables 1, 2, and 3 
contain the data collected about these races, and figure 1 gives a graphic 
representation of them. 

Only a few observations were made on the body weight of the wild 
Mus wagneri, caught in houses in Peking, since the age of such animals 
is unknown and the data, therefore, are not directly comparable with those 
obtained in tame races. In nine wild males, apparently not very young, 
the weight varied between 10 and 16 grams, in six non-pregnant females 
it varied between 9 and 15 grams, whereas two pregnant females weighed 
14 and 20 grams respectively. These figures agree very well with those of 
table 1 for W T W. 

The three tables offer many interesting points. In the albino Mus 
wagneri the male is heavier than the female as is the case in all other 
varieties of mice. Mus wagneri is at every age, on the average, lighter 
than Mus musculus. This is a well known fact, but it is here studied in 
detail. Those animals which become very heavy do so by accumulating 
fat. They differ in general appearance from Mus musculus of the same 
weight. In W T W there is a great variation in body weight, the heaviest 
animal being three or even four times as heavy as the lightest animal of 
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the same age. This fact finds its expression in a high coefficient of varia- 
tion. 

Table 2 for Mus musculus, W M, gives the weight for males and females 
separately only after the age of six weeks, as at the time when these data 
were collected the sexes were not separated before the age of seven weeks, 
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FicurE 1.—Growth curves for the albino Mus wagneri, W T W; the albino Mus musculus, 
W M; and for the albino Mus musculus selected for a high number of tailrings, H TM. After the 
age of 4 weeks (in W M 6 weeks) the growth curve for both sexes is given. 


which is too late. The table also contains the figures for the weight of 
animals of both sexes together after the age of seven weeks, but these 
have little significance on account of the marked sexual difference in body 
weight. At all ages the heaviest animal is only about twice as heavy as 
the lightest animal of the same age, and, consequently, the coefficient of 
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variation is lower than in W T W. It is slightly higher in the female than 
in the male. 

Table 3 for the albino Mus musculus, H T M, which was imported into 
China and probably had no Mus wagneri among its ancestors gives the 
body weight for the age of 14 and 15 weeks, whereas in the other two 
races the weight at 13 weeks was followed immediately by the weight at 
4 months. This was done on account of the fact that in both sexes the 
growth curve went down during the thirteenth week after which the body 
weight slowly rose again to reach what is apparently its normal figure at 
the end of the fourth month. This irregularity in the growth curve is 
partly caused by the fact that the group of mice weighed when 12 weeks 
old was not identical with the group weighed at the age of 13 weeks. In 
fact, in all three races very few mice were weighed from the beginning to 
the end of the growth curve. Nevertheless, the growth curves are fairly 
smooth on account of the large number of mice weighed. Only in race 
W M before the age of 13 weeks and in race H T M after the age of 12 
weeks some irregularities occur which must be studied more carefully to 
be completely understood. In many cases also the individual growth 
curves of the mice show more than the usual irregularities during these 
weeks. 

Another remarkable fact is the low degree of variation in the body 
weight of H T M, especially in the male sex. In all cases the heaviest male 
is only one and a half times as heavy as the lightest male of the same age. 
The coefficient of variation for the male is generally below 10. In H T M 
more than in the other two races it is obvious that the male is less variable 
than the female in regard to body weight, although, as remarked before, 
pregnant females were not included in the investigation. In the albino rat 
the reverse is true. 

Since the low degree of variability in body weight cannot be ascribed 
to inbreeding, which was practised irregularly and on about the same scale 
in all three races, it must be a consequence of the selection for a high num- 
ber of tailrings which had been practised to obtain the race H T M. The 
coefficient of variation for the number of tailrings, as given in the first 
paper of this series, is highest for W T W and lowest for H T M, whereas 
W M has an intermediate position. Exactly the same holds true for the 
coefficient of variation of the body weight. The correlation between the 
number of tailrings and the body weight must be studied in greater detail, 
but so far it seems as if reduction of the variability of the number of 
tailrings, by selection, goes parallel with a reduction of the variability in 
body weight. It is much more difficult to select the body weight than the 
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number of tailrings, because the body weight changes from day to day 
during the whole life of the individual, whereas the number of tailrings is 
constant after the age of 5 weeks. One has, therefore, in selection for the 
number of tailrings a valuable means of indirect and provisional selection 
for body weight which must be of importance in all those cases where the 
body weight of mice is studied experimentally. 

When the three races of mice are compared it appears that there is very 
little difference between W M and H T M, proving that W M, although 
of hybrid origin, is a true Mus musculus. On the other hand, the average 
body weight of the tame Mus wagneri, W T W, at all ages and for both 
sexes is less than the body weight of Mus musculus. Generally speaking 
one may say that the body weight of Mus wagnervi is two-thirds of that of 
Mus musculus. 

SUMMARY 

The average body weight of Mus wagneri is two-thirds of that of Mus 
musculus. 

In both species of mice the male is heavier than the female. 

There is little difference in body weight between a pure albino Afus 
musculus imported from the United States of America into China and an 
albino Mus musculus, derived from hybrids with Mus wagneri, bought in 
Peking. 

In Mus musculus the coefficient of variation of the body weight is 
higher in the female than in the male. 

The coefficient of variation of the body weight is highest in the albino 
Mus wagneri, lower in the albino Mus musculus of hybrid origin and still 
lower in the albino Mus musculus selected for a high number of tailrings. 


It goes parallel with the coefficient of variation of the number of tailrings 
in these races. 
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INTRODUCTION 

Genetic investigations with Zea mays have established ten linkage 
groups. Likewise, cytological investigations have revealed the presence 
of ten morphologically identifiable chromosomes composing the haploid 
complement (McC tintock 1929b). It is the aim of the present investi- 
gation to correlate a particular linkage group with a particular chromo- 
some. 

The method employed has been to obtain 2n+1 plants trisomic for 
different members of the haploid complement, and then, by means of 
trisomic inheritance, to determine which chromosome carries a particular 
group of genes. 

The 2n+1 plants have been obtained from the progeny of one original 
triploid (McCtirntock 1929a). The chromosome number in the female 
gametes of a triploid varies from ten to twenty. Trisomic individuals were 
obtained directly from theF, of a cross triploid X diploid and from the 2n+1 
progenies of F, individuals with more than one extra chromosome. 

Inheritance data obtained from the 2n+1 plants of culture 131 sug- 
gested that in this culture the r-g carrying chromosome was present in 
triplicate. An effort was therefore made to test the validity of this inter- 
pretation and to verify the genetic inference that the nine other linkage 
groups are independent of the r-g chromosome. 

The evidence indicates that the smallest chromosome of the haploid 
complement carries the genes of the r-g linkage group. Thus, 2n+1 plants 
of culture 131 showing trisomic inheritance for r have the smallest chromo- 
some in duplicate in their 11-chromosome microspores. Likewise, 2n+1 
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plants of other cultures which show the smallest chromosome in duplicate 
in 11-chromosome microspores have given trisomic inheritance for r in 
later genetic investigations. 


METHODS 


Root-tips were fixed in a chromic-acetic-formalin mixture and sectioned 
in paraffin. The aceto-carmine smear method was used for sporocytes 
which previously had been fixed in an acetic-absolute alcohol mixture. 
The microspores were fixed in an acetic-absolute alcohol mixture, stained 
with carmine and cleared with chloral hydrate. 

For morphological studies of the chromosomes, late prophase stages of 
the division of the microspore nucleus were found most valuable, since 
the chromosomes at this stage are longer, their constrictions are more 
obvious and the relative length of their arms is more readily determined 
than in the contracted metaphase stage. From comparative studies of the 
most easily distinguishable chromosomes of the complement it is clear 
that the morphology as shown by the prophase and metaphase microspore 
figures is essentially similar to that shown in the root tips. The presence 
of only the haploid complement and the ease of observation in the micro- 
spore favored the use of this stage for cytological studies. 


OBSERVATIONS ON TRISOMIC INHERITANCE FOR R 


As has been stated, the 2n+1 individuals of culture 131 were trisomic 
for the r-g chromosome. This culture arose from a 2n+1 plant which had 
been selfed. Of 61 individuals examined, 39 (or 63.7 percent) were 2n, 
21 (or 34.4 percent) were 2n+1 and 1 (or 1.6 percent) was 2n+2. It can 
be safely assumed that all the 2n+1 plants of this culture were trisomic 
for the same chromosome, since the normal rate of non-disjunction in 
Zea mays is very low. 

As a result of the distribution of the members of the trivalent at 
meiosis, 11-chromosome carrying and 10-chromosome carrying gametes 
are formed. In a normal pollination, the extra chromosome carrying pollen 
grains do not function well in competition with the n-carrying pollen grains 
(see table 1). For genetic investigations, therefore, it is necessary to 

TABLE 1 
Percent of 2n+-1 individuals resulting from the cross 2n9 X2n+1c. 











2n9 X2n+1c" | 2n 2n+1 Percent 2n+1 
192, X 224; 261 3 RIS 
1923 X 2253 83 2 2.35 
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consider the functioning of 11-chromosome carrying gametes only in 
the case of the female. On the basis of random distribution of the three 
similar chromosomes at meiosis one should expect half of the eggs to 
carry the extra chromosome, but actually only about one-third of the 
eggs carried it (see above and table 2). This discrepancy can be explained 
on the basis of irregularities at meiosis. Nine bivalents and one tri- 
valent are found at metaphase I only in approximately two-thirds of the 
sporocytes; in the other sporocytes there are ten bivalents and one uni- 
valent. When the extra chromosome appears thus as a univalent its 


TABLE 2 
Number 2n+-1: 2n individuals from the cross 2n+-19 X2nc. 











CULTURE | 2n 2n+1 
166 11 4 
168 | 1 1 
176 5 2 
224 2 2 
225 14 5 
229 | 10 10 
230 14 4 
231 17 2 
232 9 11 
* Totals 83 41 33.06 percent 
2n+1 











behavior is very irregular (McCirntock 1929a). It may not go into the 
spindle figure but remain in the cytoplasm. It may be found in an ab- 
normal position in the spindle. Again, the univalent may lag in the central 
part of the spindle with or without showing evidence of a separation of its 
split halves. If the halves should be included in the two telophase I 
nuclei, they would probably lag in the second meiotic mitosis. In all of 
these cases a loss of the univalent will occur in meiosis, with the formation 
of all n-carrying nuclei instead of half n-carrying and half n+1. This 
phenomenon could account for the increased ratio of n to n+1 gametes. 
The difference in many cases is probably not due to lack of viability of 
n+1 gametes or 2n+1 plants, since in many ears of Zea mays the regular- 
ity of row and kernel position allows undeveloped kernels to be readily de- 
tected. Some 2n+1 plants bore almost perfectly filled ears. It is possible, 
also, that the lowest megaspore, if it contains the extra chromosome, does 
not function to produce the embryosac but, in a certain percent of the 
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cases, one of the megaspores that contains the haploid complement func- 
tions instead. 

The trisomic individuals of culture 131 were crossed so that their prog- 
enies were heterozygous for at least one factor of every linkage group. 
Backcross and F, ratios were obtained to determine which factors were 
inherited on a trisomic and which on a disomic basis. Abundant evidence 
for trisomic inheritance of r in the 2n+1 progenies of culture 131 (cultures 
189, 209, 224, 225, 229, 231) was obtained. 

Simple ratios may best be considered first. When the R factor for red 
aleurone is duplex (KRr) the gametic ratio expected from random dis- 
tribution of the extra chromosome is 2R:2Rr:1RR:1+, ora total of 5R:1r. 
Since only the n-carrying pollen grains need be considered, the functioning 
male gametic ratio is 2X:ir. In the 2n+1 progenies of culture 131 duplex 
for R the backcross ratios through the pollen were 646R:355r (table 3), 

















TABLE 3 
ro XRRros | COLORED | COLORLESS 
_ | a ar 
192.225; | 240 91 
1925 X 20949 198 141 
19219 X 224, 208 123 
355 


Totals | 646 | 





a fair approximation to 2:1. On the same basis, crosses of 2n Rr 9 X2n 
+1RRr¢& should give 5R:1r. Table 4 shows a total count of 2102R:435r. 
A sib cross between two heterozygous 2n individuals gave 290R:88r, or 
the expected 3:1 ratio (table 7). 


























TABLE 4 
RrQ XRRro COLORED COLORLESS 
22511 X 224s 265 | 74 
22514224, 235 49 
2317X 231, 382 ol 
23114 X 2314 419 82 
23116 X 2319 357 | 82 
23117 X 2314 | 444 | 87 
Totals | 2102 | 435 








When R is simplex (Rrr) the expected gametic ratio is 1R:2Rr:1rr:2r. 
With elimination of the n+1 carrying pollen grains the functioning male 
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gametic ratio is 1&:2r. In the progenies of culture 131 only one individual 
tested was so constituted and gave a backcross ratio through the pollen 
of 110R:234r (table 5). Conversely a 2:1 ratio is expected in crossing a 
2n Rr with this 2n+i of constituiion Arr; actual counts showed 348R: 182, 
(table 5). 


PaBie 5 
Crosses involving the simplex (Rrr) individual, 2245. 





rQXRrd 














COLORED COLORLESS 
| ! _ 
192,224, 110 | 234 
= EEE ants ie —— 8 SSS a a SS __aeeeN—" 
RrQ XR | 
2244x224; 182 97 
2244224; second ear | 166 85 
quesssen-eses —_ ordjauuneds —EEE —s 5 ————————————eeEeEeEeEeEeeeeeeeEeeeeEeEeEeEEEEOeee—eee 
Totals | 348 182 
—— eg com — os ais at —————— — 
RrrQ XRrod | 1 
| ; 
224; 224, | 160 57 














The extra chromosome having thus been shown to carry a factor for 
the r-z linkage group, cytological examinations were made in order to 
determine which of the ten chromosomes of the haploid complement it 
was. Since the ten chromosomes are all morphologically distinguishable, 
it was only necessary to examine the 11-chromosome carrying microspores 
and see which chromosome was present in duplicate. Observations on 
diakinesis had already indicated that the chromosome involved was either 
the smallest or the next to the smallest. 

The methods devised at the time of the investigation for the observa- 
tion of the late prophase chromosome made it somewhat difficult to obtain 
figures with all of the chromosomes lying perpendicular to the optical 
axis. Some good figures were obtained, however (figure 1). Many figures 
were found with all but one or two chromosomes lying flat. Since the 
differences in size between the four smallest and the six largest chromo- 
somes are obvious in the later prophase stage almost regardless of the posi- 
tion of the chromosomes in the nucleus, it is comparatively easy to know 
when the four smallest chromosomes are lying flat, and hence to obtain 
accurate figures (figure 2). In the 11-chromosome microspores in which one 
chromosome is present in duplicate it is easy to determine whether it 
belongs to the group of four small chromosomes or to the group of six 
large ones. When it belongs to the former group one observes five small 
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instead of four small chromosomes, and the frequency of figures with all 
five chromosomes lying in the desired plane is sufficiently high to make 
accurate comparisons of the chromosomes for the purpose of determining 
which of them is present in duplicate. Such a case is shown in figure 3. 





FicureE 1.—Late prophase chromosomes in an 11-chromosome (n+1) microspore. The arrows 
indicate the duplicated chromosomes of the haploid complement; these are the r-g carrying 
chromosomes. At this stage the chromosomes are frequently very angular. 2150. 


Da 


FicurE 2.—The four smallest chromosomes from a normal (n) microspore; late prophase 
2150. 


Bag ca 


FicurE 3.—The five smallest chromosomes from an 11-chromosome (n+1) microspore. 
Note that the chromosome in duplicate is the smallest chromosome of the set. Compare with 
figure 2. 2150. 


It is clear that the extra chromosome which carries the genes for the 
r-g linkage group is the smallest chromosome of the haploid complement. 
The presence in triplicate of the smallest chromosome does not markedly 
affect the growth or appearance of the plant. One is unable to detect 
even a moderate difference in comparing 2n+1 with 2n individuals. On 
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the average, the 2n+1 individuals are somewhat weaker than the 2n 
individuals. Since genetic material of Zea mays is very heterozygous for 
growth and morphological characters, it is possible that the morphological 
changes produced by the presence of this extra chromosome would be 
recognizable only in fairly homozygous material. In consequence of the 
inability to recognize 2n+1 plants with certainty in the field, every plant 
must be examined cytologically or tested genetically to detect the presence 
of the extra chromosome. 





Ficure 4.—Photomicrograph of microspore showing late prophase chromosomes of first 
nuclear division. The smallest chromosome of the complement, that which carries the genes of 
the R-G linkage group, is the second chromosome from the left. It is the only chromosome in focus 
throughout its entire length. 1800. 


Another 2n+1 plant (144;) not directly related to culture 131, an F, 
individual from the cross triploid Xdiploid, was examined cytologically. 
The examination showed clearly the presence of the smallest chromosome 
in triplicate. It was therefore necessary to prove that this plant or its 
2n+1 progeny would show trisomic inheritance for 7. Colored kernels from 
the cross 144; (2n+1)xACr (2n) were grown. The 2n and 2n+1 F; in- 
dividuals were backcrossed to r testers (ACr). Pollen of 2n+1 plants 
placed on 7 testers gave a total count of 1282 colored to 2451 colorless 
kernels, a simple trisomic ratio of 1:2 (table 6). Plant 144, probably had 
the constitution Rrr, since each of the 2n+1 offspring was Rrr. The 2n 
sibs (culture 232, table 7) showed, in contrast, a good disomic 1:1 ratio. 
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r?XRrrd COLORED COLORLESS 
19412 X 2322 99 178 
| 
194142322 33 75 
19434 X 232.4 | 82 | 153 
19445 232, i9 184 
194; X 232. | 97 140 
194,3X232¢ | 78 174 
194; X232s 82 | 167 
194, X232n ] 69 132 
194, X232n 104 | 154 
1921223212 | 89 | 199 
194, X232i2 98 202 
19416X 23213 52 100 
19419 X 23213 99 | 195 
193. K 23216 27 | 55 
1933 K2321¢ | 79 | 143 
193, X232i6 | 115 | 200 
Totals 1282 | 2451 
TABLE 7 
Crosses involving 2n Rr individuals of cultures 189, 225, 231 and 232. 
Rr seLrep | COLORED | COLORLESS 
_——————————— | a | 
22510 | 274 | 99 
231i 334 | 105 
oe a =a 
Totals 608 204 
ii a 
Rr XRro" | 
| 
ies —| = iia 
22517 X 2256 | 290 | 88 
RrQ Xrra | 
22512 1924 184 219 
22513 X 1924 177 187 
232; X19415 172 195 
2325 X192, 113 104 
2327 X192¢ 221 198 
23214X 192¢ 182 168 
2323 X192, 70 73 
23210 19443 20 30 
23213 X 192. 22 22 
Totals 1161 1196 
mrQRrd 
192; 189As50 132 135 
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Still another unrelated trisomic plant (219A;) was found which possessed 
the smallest chromosome in triplicate. Its genic constitution with respect 
to r was unknown, but its pollen was used on r testers with the thought 
that it might be heterozygous. The total backcross ratio from the three 
ears obtained (table 8) was 295R:1317, indicating a duplex (RRr) con- 
dition in this plant. 











TABLE 8 
mr? XRRrs* | COLORED | COLORLESS 
19212. 219A; 192 | 94 
193, X219A; 15 11 
194,,X219A5 | 88 | 26 
Totals | 295 | 131 





Thus far only ratios through the pollen have been considered. The ratios 
are simple and agree with the expectations. The trisomic ratios as ex- 
pressed through the female resulting from the cross 2n+19 X2no are 
complicated by the presence of an excess of 2n over 2n+1 individuals. 
Possible explanations for this have been given on page 177. As was stated 
there, if there were random assortment of the three homologous chromo- 
somes at meiosis 50 percent of the gametes should be n and 50 percent 
n+1. Actually, only about one-third of the eggs carry the extra chromo- 
some. This condition materially changes the expected ratio. 

In a simplex plant (Rrr) the gametic ratio, on the basis of random 
assortment of the three homologous chromosomes would be 1R:2Rr:1rr:2r 
which would give on backcrossing to rr plants, a phenotypic ratio of 
1R:1r. If, however, lagging and loss of one of the three chromosomes 
occur and at random (or if there is also some selection against n+1 
megaspores in favor of n) the gametic ratio produced as a result of these 
occurrences would be 1R:2r and all the gametes would be n. The total 
gametic ratio, therefore should fluctuate between these two extremes de- 
pending upon the relative amounts of random distribution of the three 
homologous chromosomes as compared to lagging and loss that have 
occurred. It is therefore necessary to know what proportion of the female 
gametes were produced after random assortment and what proportion 
after loss. 

If a 2n+1 simplex plant (Arr) is pollinated with pollen from a 2n rr 
plant the proportion of R-carrying and r-carrying gametes may be 
directly determined. The amount of lagging, or of lagging plus mega- 
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spore selection, then, may be computed from the distortion of the 1:1 
ratio (expected with no lagging) in the direction of a 1:2 ratio (expected 
with 100 percent lagging). An approximate measure of the total amount of 
lagging and loss that has occurred during megasporogenesis in the ovules 
of the 2n+1 plants of culture 232 (table 9) has been obtained in this man- 

















TABLE 9 
RrrQ Xrrc" COLORED COLORLESS 
23241924 137 178 
2326 X 19415 130 149 
2323192. 115 150 
2329 X 192. 137 178 
23216 X 192¢ 160 181 
Totals | 679 | 836 





ner. The 2n+1 plants of culture 232 were all Rrr (see table 6). They 
were pollinated with pollen from 2n r-testers (ACr). Each ear obtained 
showed a deviation from a 1:1 ratio expected with random assortment of 
the three homologous chromosomes toward the 1:2 ratio expected from 
iagging and loss, or from lagging and loss plus megaspore selection. Of the 
1515 kernels obtained 679 were colored (dominant) and 836 were colorless 
(recessive). 

1R:1r=gametic ratio due to random assortment of three homologous 
chromosomes. 

1R:1r:1r =gametic ratio due to lagging and loss. 

When these two gametic ratios are placed one above the other, it is 
seen that 679 represents the number of R-carrying gametes produced by 
both types of distribution, and similarly, 836 the number of r-carrying 
gametes produced. If 679 represents the number of R-carrying gametes pro- 
duced, then 679 can be given to a similar proportion of r-carrying gametes. 
The total number (679+679) is 1358. The difference between 1515, 
the total number of gametes, and 1358 is 157, which is the number of the 
remaining recesives gametes (or kernels). It is easy to see by this method 
that this number (157) represents one-third of the total number of gametes 
produced as the result of lagging and loss (or lagging and loss plus mega- 
spore selection). The number 157 is also the amount by which the re- 
cessives exceed the dominants. Thus, in a population composed of indi- 
viduals, some of which, taken as a group, represent a 1:1 ratio and others 
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of which, similarly, represent a 1:2 ratio, the number by which the re- 
cessives in the total population exceed the dominants should equal one- 
third of the individuals representing the 1:2 ratio, or, in the present in- 
stance, one-third of the individuals produced as a result of lagging. 

The number of colorless kernels (recessives) exceeds the number of 
colored kernels (dominants) by 157; hence, the total number of gametes 
produced after loss should be three times 157, or 471. This number repre- 
sents approximately 30 percent of the total number of kernels. Assuming 
that the data represent a sufficiently uniform random sample, it can be 
concluded that approximately 30 percent of the gametes were produced 
after loss of the extra chromosome through lagging at the first or second 
meiotic mitosis, or possibly through lagging and loss plus any loss that 
may have occurred through selective viability of megaspores. 

If 30 percent of the gametes were produced through such loss, then 
loss occurred in 30 percent of the ovules. From the remaining 70 per- 
cent of the ovules half, or 35 percent, should possess n+1 female gametes 
and consequently 35 percent of the plants from the cross 2n+19 X2n@ 
should carry the extra chromosome. This, in turn, can well explain the 
deviation from the expected ratio of 2n+1 to 2n individuals in the cross 
2n+1X2n (see page 177 and table 2). It is expected that both cytological 
and genetical evidence will be obtained which will indicate more definitely 
how much of the deviation is actually due to lagging and loss, and how 
much may be due to a possible selective viability of n+1 gametes or 
2n+1 embryos. 

Assuming this same amount of loss, deviations from the 5R:1r ratio 
expected in the cross RRrXrr (table 10) can be interpreted. Each in- 











TABLE 10 
RRrQ Xrra COLORED COLORLESS 
209411929 (first ear) 130 33 
2094: 192s (second ear) 207 56 
20941933 (tiller ear) 193 52 
20958 X 192¢ 289 72 
Totals 819 213 











dividual cross shows a deviation from the 5:1 ratio in an increase in the 
number of colorless kernels over expectancy. The total count of 819 
colored: 213 colorless represents a ratio of 3.84:1 instead of 5:1. A ratio 
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of 3.61:1 would be expected if it were assumed that 30 percent of the 
female gametes were produced as the result of loss. The ratio due to loss 
is 2R:1r, as contrasted with the ratio of 5R:1r produced through random 
assortment of the three homologous chromosomes. 

These same assumptions are utilized in explaining the deviations ob- 
served in selfing an individual of the constitution RRr. On the basis of 
random assortment and no lagging of the three homologous chromosomes, 
the female gametic ratio should be 2R:2Rr:1RR:1r, the male, because 
of the elimination of the extra chromosome carrying pollen, 2R:ir. One 
would expect, therefore, a 17:1 ratio in F;. On the basis of 30 percent of 
the gametes being produced as the result of loss of the extra chromosome 
a 12.8:1 ratio would be expected instead. Table 11 shows the results 
obtained. 











TABLE 11 
AACCRRr setrep COLORED COLORLESS 
2319 first ear 396 . 41 X?=2.47, P=0.10+ 
2319 second ear 188 24 
Totals 584 65 X?=7.42, P=>0.01 











Determinations of goodness of fit by means of the x? method indicate 
a significant deviation on the total counts. A full ear of well developed 
kernels of unmistakable classification resulted upon selfing the first ear. 
A x? determination on this ear alone gives a fit well within the probability. 
The second ear on this plant produced by selfing was poorly filled, with 
many kernels underdeveloped. It is possible that in this ear the color in 
some of the kernels did not develop. This possibility is supported by the 
fact that some kernels on this ear showed the presence of color by only 
a slight degree of mottling. It is possible, also, that some 2n+1 embryos 
did not develop fully on this particular ear. 

The description of trisomic inheritance of r given above shows the nature 
of trisomic inheritance in Zea mays with regard to the smallest chromo- 
some of the haploid set. 


INDEPENDENCE OF THE R-G LINKAGE GROUP 


The method of trisomic inheritance is a convenient means of determin- 
ing with certainty the independence of linkage groups. Evidence obtained 
from both cytological and genetical observations indicates that the r-g 
linkage group is independent of all the other nine linkage groups estab- 
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lished genetically. At least one factor of each linkage group has been tested 
(c and wz, Su, b, v, gu, pr, f, d and a). 2n+1 individuals heterozygous for 


these genes have been selfed and backcrossed. 


TABLE 12 


Results of crosses involving c-wz, Su, 6, ¥, gu, Pr, d and a among 2n+1 individuals trisomic for 


the r-g chromosome. For explanation see page 189. 








I. Disomic inheritance of c and w, (see appendix). 











2n+1 Cc [C or c]X2n AcR Cc c 
13133 X B346; 78 69 
2n AcRX2n+1 Cc [C orc] 
104288; 194 149 
B346; X 131; 172 184 
W: W: 
2n+1 W.w,z [W.] pollen counts 535 549 
2n W-w,z pollen counts 633 594 
2n w,w, X2n+1 W.w, [Wz] 
1923225; 152 179 
II. Disomic inheritance of s,. 
2n+1 Susu [su] selfed Su Su 
220; 282 110 
22014 279 97 
22014 174 53 
2205 223 68 
22017 261 90 
22017 221 86 
2319 318 102 
Totals 1758 586 
2nSysuX2n+1 Susu [su] 
22011 X18 228 98 
22514X2 208 73 
2317 X9 324 112 
231isX9 316 126 
231i7X9 359 139 
Totals 1495 548 
2n SuSuX2n+1 Susu [Su] 
188 X189Bis 190 169 
2n Susy selfed 
220; 288 99 
220s 269 7 
22012 352 112 
22510 278 95 
231; 420 138 
231is 327 112 
Totals 1934 633 
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TABLE 12—(continued) 


III. Disomic inheritance of b (see page 189). 




















2n+1 Bb[d] selfed B b 

88, 54 19 
IV. Disomic inheritance of y (see appendix). 

2n+1 Yy [¥ or y] selfed r y 
176¢ 94 36 

2n YyX2n+1 Yy [y] 
194). 232: 139 37 
194,a X 232, 113 40 
194452324 136 46 
194; X2325¢ 102 38 
19413 X 232. 128 46 
194; X232s 127 40 
19419 X 23213 143 52 
Totals 888 299 

2n yyX2n+1 Vy [y] 
194, X232n 95 89 
1945 X2321 85 63 
1941623213 77 72 
1933 X23216 66 74 
Totals 323 298 

V. Disomic inheritance of gi. 

2n+1 Gugn [Gn] X2n gngn Gn gu 

229; X 201. 113 105 
VI. Disomic inheritance of ?,. 

2n+1 P,p, [P;] selfed P, pr 
2203 149 45 
22014 287 89 
22014 170 57 
22017 228 87 
22017 196 78 
Totals 1030 356 

2n+1 P,p, [P+] X2n frp, 
22015 X 203; 129 120 
22019 X 2033 110 108 
Totals 239 228 

2n P,p,X2n+1 P,p, [Pr] P, pr 
22011 X 22018 306 80 

2n P,p, selfed 
2205 271 116 
220s 249 97 
22012 362 102 
Totals 882 315 

2n Pp, X2Nn prpy 
22013 X 203 194 207 
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TABLE 12—(continued) 


VII. Disomic inheritance of d (see appendix). 


2n+1 Dd [D] selfed D d 
231, 129 44 
2n dd X2n+1 Dd [D} 
205 X231y 119 118 
2n+1 Dd [D]X2n dd 
23112205 170 140 
VIII. Disomic inheritance of a (see appendix). 
2n+1 Aa [A ora] selfed A a 
88: 73 19 
2n+1 Aa [A or a]X2n aa 
131;X R511; 80 53 
2n aaX2n+1 Aa [A or a] 
2066 X 189 Beso 144 173 


It is needless to discuss every cross represented in table 12, for the re- 
sults are self explanatory. The crosses involving the recessive sugary 
gene (s,,) can be used as a single example (see table 12, section II). A 2n+1 
plant of culture 131, homozygous for sugary, was crossed with pollen 
from a 2n starchy plant (S.S,). The Fi 2n and 2n+1 individuals were 
selfed and backcrossed to test for trisomic or disomic inheritance. In 
each section the type of cross is indicated. The symbol of the gene placed 
in brackets indicates how the genic constitution of the 2n+1 individual 
would have differed had it been trisomic for this gene. On selfing 2n+1 
plants heterozygous for sugary a total of 1758 S, to 586 s, kernels were 
obtained, precisely a 3:1 ratio. Diploid (2n) sibs upon selfing gave a 
total count of 1934 S,:633 s, kernels. The two ratios are similar and 
disomic. If these 2n+1 plants were trisomic for sugary (SususSu), an ap- 
proach to a 2:1 ratio would be expected. Similarly, a 2:1 ratio would be 
expected in the sib crosses 2n (S.s.) X2n+1. Here a ratio of 1495 S,:548 
Su kernels was obtained, a 3:1 instead of a 2:1 ratio. It is therefore con- 
cluded that the linkage group including s, is independent of the r-g 
linkage group and must be associated with another chromosome. The 
data on factors c, w:, y, gn, pr and d are sufficiently numerous to need 
no further explanation. 

In the case of b, genetic data are hardly necessary, since the b-/, linkage 
group has been associated with another chromosome. 

The data on the a factor are few but indicate a disomic instead of a 
trisomic inheritance. In the case of a 2n+1 heterozygous a plant selfed 
the results (73A:19a) indicate neither a duplex (AAa) 17—:1 ratio nor a 
simplex (Aaa) 2:1 ratio, but better, a disomic 3:1 ratio. Further, the cross 
of a heterozygous 2n+1 plant X2n aa would have given, if duplex, 5A :1a 
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or if simplex, 14:1+ a. 80A:53a probably represents a 1:1 disomic ratio. 
In the case of 2n aaX2n+t heterozygous a, a simplex constitution would 
have given a 1:2 ratio and a duplex constitution a 2:1 ratio. 1444:173a 
approaches neither of these but probably represent's a 1:1 disomic back- 
cross ratio. 

The factor for fine stripe (f) did not segregate sharply in the seedling 
stage so that backcross counts were not sufficiently reliable. Cytological 
evidence from Doctor BRINK’s material indicates that the f-), linkage 
group is carried by along chromosome. 


SUMMARY 


1. A 2n+1 plant of Zea mays resulting from the cross diploid X triploid 
and its 2n+1 progenies were found to give trisomic inheritance for r. 

2. In these plants the smallest chromosome is present in triplicate. 

3. Two unrelated 2n+1 individuals were found to be trisomic for the 
smallest chromosome of the haploid set. These plants, upon later testing, 
gave trisomic inheritance for r. , 

4. In 2n+1 individuals one-third of the eggs carry the extra chromo- 
some. In a normal pollination the extra chromosome-carrying pollen 
grains function only in a small percentage of the cases. 

5. Plants trisomic for the r-g linkage group have given disomic inheri- 
tance for c, wz, Su, >, v, gu, pr, d and a. 
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APPENDIX 


During the time this paper was in press the following linkage groups 
were found to be associated with chromosomes other than the r-g carrying 
chromosome: C—s,—w., Y—P:, A—di—c,. By the method of association 
of linkage groups with particular chromosomes the independence of six of 
the ten linkage groups (C—s,—w., R—g, B—1,, Y—P 1, P—b,, A—d,—¢,) 
has been definitely established. 
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